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Abstract: We compute trivariate probability distributions in the landscape, scan- 
ning simultaneously over the cosmological constant, the primordial density contrast, 
and spatial curvature. We consider two different measures for regulating the diver- 
gences of eternal inflation, and three different models for observers. In one model, 
observers are assumed to arise in proportion to the entropy produced by stars; in the 
others, they arise at a flxed time (5 or 10 billion years) after star formation. The star 
formation rate, which underlies all our observer models, depends sensitively on the 
three scanning parameters. We employ a recently developed model of star formation in 
the multiverse, a considerable reflnement over previous treatments of the astrophysical 
and cosmological properties of different pocket universes. For each combination of ob- 
server model and measure, we display all single and bivariate probability distributions, 
both with the remaining parameter(s) held flxed, and marginalized. Our results depend 
only weakly on the observer model but more strongly on the measure. Using the causal 
diamond measure, the observed parameter values (or bounds) lie within the central 2a 
of nearly all probability distributions we compute, and always within 3a. This success 
is encouraging and rather nontrivial, considering the large size and dimension of the 
parameter space. The causal patch measure gives similar results as long as curvature is 
negligible. If curvature dominates, the causal patch leads to a novel runaway: it prefers 
a negative value of the cosmological constant, with the smallest magnitude available in 
the landscape. 
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1. Introduction 



String theory appears to give rise to a large vacuum landscape, containing perhaps ten 
to the hundreds of metastable vacua with three large spatial dimensions [1,2]. (See, 
e.g., Ref. [3] for a discussion of earlier work.) Parameters that appear fundamental 
at low energies can vary among these vacua and must be predicted statistically. The 
probability for a particular value is proportional to the expected number of times it is 
observed. In particular, the cosmological constant. A, will vary. Thus, the landscape of 
string theory provides a theoretical foundation for Weinberg's [4] famous (and correct) 
prediction of a small but nonzero value of A [1,5,6]. In the string landscape, however, 
not only A but also many other parameters are expected to scan. This means that 
there are many additional opportunities to falsify the theory. 

It is legitimate to consider only a subset of the landscape, defined by one variable 
parameter (such as A), with all other parameters fixed to their observed values. If our 
observations are highly atypical among the set of observations made in this restricted 
class of vacua, then the theory is ruled out. If they are typical, then the theory has 
passed a first test. We can then move on to test the theory further, by predicting a 
second parameter — for example, the primordial density contrast, Q. If this succeeds, 
we have yet another chance to falsify the theory by computing a joint probability 
distribution over both parameters: now we might find that our universe is very unlikely 
compared to one in which both parameters differ. If the theory is still not ruled out, we 
can consider a third parameter (in the present paper, the amount of spatial curvature), 
and compute yet more probability distributions. Each new probability distribution we 
compute is another chance for the theory to fail. 

In this paper, we present the first detailed computation of a trivariate probability 
distribution in the landscape. We display all single-variable and bivariate distributions 
that can be extracted from it. 

The computation of such probability distributions is complicated by a number of 
challenges. What are observers? Given a model for observers, can we actually compute 
how many observations will be made as a function of the scanning parameters? In this 
paper, we consider three models for observers, all of which require computing the rate 
at which stars form, as a function of time. We have recently developed a numerical tool 
for computing the star formation rate in vacua with different values of A, Q, and spatial 
curvature [7]. Here, we apply our star formation model to the challenge of estimating 
the rate of observations made in this three-parameter space of vacua. As far as we 
know, this is the first time that the cosmological and astrophysical evolution of other 
vacua has been modeled at such level of detail. 

Another challenge is the measure problem. Long-lived vacua with positive cosmo- 
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logical constant, which are abundant in the string landscape, lead to eternal inflation [1]. 
Globally, spatially inflnite bubbles of each type of vacuum are produced over and over. 
Everything that can happen will happen inflnitely many times. To compute relative 
probabilities, such as the ratio of the numbers of times two different parameter values 
are observed, this divergence has to be regulated. 

Recent years have seen considerable progress on the measure problem. Several 
proposals have been ruled out because they conflict violently with observation [8-18]. 
Interestingly, several measures that manage to evade the most drastic problems appear 
to be closely related. They differ at most by subexponential geometric factors [19]. 
Indeed, some of them have been shown to be precisely equivalent [20,21], despite having 
superflcially a very different form. This apparent convergence is encouraging. It is all 
the more important to thoroughly test extant proposals, and to discriminate between 
them, by computing probability distributions and comparing them to observation. 

Here, we consider two closely related but inequivalent members of the surviving 
group of measures: the causal diamond cut-off [22, 23] and the causal patch cut- 
off [22]. A particularly interesting discovery has been that these two measures pro- 
vide a novel catastrophic boundary on parameter space, beyond which observations 
are suppressed — not for dynamical reasons, like galaxy formation, but geometrically. 
For example, for some values of a scanning parameter, the cut-off region may have 
exponentially small comoving volume, and for this reason alone will contain a very 
small number of observers. This geometric effect provides a stronger upper bound on 
A than the disruption of structure [23]. (This result is reproduced here as a special 
case.) It also provides a stronger constraint on the ratio of dark matter to baryonic 
matter [24]. In both cases, geometric suppression has signiflcantly improved agreement 
between theory and observation. The results presented here will reflect the effects of 
geometric suppression in a larger parameter space, and we will highlight these effects 
in the discussion of our results (Sec. 5). 

Scope and method We consider three cosmological parameters: the cosmological 
constant. A; the primordial density contrast, Q = — ; and the spatial curvature. We 
parametrize spatial curvature by the logarithmic quantity AA^, which can be thought of 
as the number of inflationary e-foldings minus the minimum number required to explain 
the observed flatness of our universe. We scan over the three-dimensional parameter 
space 



10-^Ao < |A| < 10^ Ao 

io-^go< Q <io'go 

-3.5 < AA < oc , 



(1.1) 
(1.2) 
(1.3) 
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where Aq and Qq are the observed values. For each combination (A, Q, AA^), we com- 
pute a history of structure formation and of star formation in the corresponding uni- 
verse. We use our own model of star formation [7], which was designed to handle 
variations of these parameters over several decades. The upper limit on Q is motivated 
by a change of regime found in Ref. [7]: For Q < lO^Qo, most of the star formation 
happens well after recombination, where we can trust our model; for larger values, we 
cannot not. 

We obtain single- and multivariate probability distributions by computing the ex- 
pected number of times each parameter combination (A, Q, AA^) is observed in the 
multiverse. We consider three different models for observers. One model assumes that 
the rate of observation tracks the rate of entropy production by stars [22,23]. The 
other two are based on the assumption that the rate of observations follows the rate at 
which stars are produced, with a delay of five or ten billion years. 

Our computation is numerical. Even an elementary treatment of the physics of 
structure formation and star formation involves a complex interplay of different phe- 
nomena. In our own universe, several of these processes, such as structure formation, 
radiative galaxy cooling, Compton cooling of galaxies, galaxy mergers, observer evolu- 
tion, and vacuum domination, happen roughly on the same time scale, a billion years 
to within about one order of magnitude. (The lack of any known symmetry that could 
explain this multiple coincidence is itself evidence for a multiverse [25].) The parame- 
ter range we consider includes values in which curvature, too, comes to dominate at a 
time comparable to the previously mentioned scales. Coincidences of scales preclude a 
separation into well-defined analytic regimes, necessitating a numerical computation. 

Coincidences of scales arise not just for our own universe, but persist on certain 
hypersurfaces of the parameter space we consider. The time of structure formation 
scales as Q~^^^] the radiative cooling time as Q~^; the time of vacuum domination as 
A~^/^; and the time of curvature domination as exp(3AA^). So, for example, in universes 
whose parameters lie near a certain hypersurface of constant Q^/A, the beginning of 
structure formation and its disruption by vacuum domination will not be well separated. 
In the neighborhood of such surfaces, analytical arguments are very imprecise, and 
numerical treatment is essential. 

Numerical computation is somewhat complementary to analytical arguments. Our 
code becomes unstable when certain separations of scales become too large. This limits 
the parameter range we can consider numerically. Strictly speaking, our results pertain 
only to the subset of the landscape defined by the above range of parameters. But for 
the same reason — a good separation of scales — we can often extrapolate analytically to 
a larger range. Near some boundaries of our parameter range, the probability density 
is negligible, and analytic arguments tell us that it will continue to decrease. In these 
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cases, we can safely neglect the missing part of the probability distribution. We can 
also do so if the probability density is increasing towards the boundary, but there is a 
catastrophic change of regime at the boundary that sharply suppresses the number of 
observations in universes beyond the boundary. (For example, if we increase Q and A 
while holding Q^/A fixed, eventually vacuum domination will occur before recombina- 
tion. Since star formation can begin only after recombination, when dark matter halos 
are exponentially dilute, such universes have negligible probability of being observed.) 
However, near some boundaries, the probability distribution is increasing and there is 
no change of regime at or near the boundary. In this case, the probability distribution 
may be dominated by regions outside the parameter range we consider numerically. In 
general, we can use analytic arguments to understand its behavior in this regime. An 
example is the runaway towards small values of |A| that we find with the causal patch 
measure. 

Results Our results are fully displayed in Sec. 4. Its six subsections correspond to the 
six combinations of measure and observer model we consider. For each model, we show 
about 30 plots corresponding to different combinations of parameters that are varied, 
held fixed, or integrated out. We discuss our results in Sec. 5, where we highlight several 
interesting features. We provide a qualitative understanding of these features, and we 
explain how probability distributions depend on the measure and the observer model. 
Most of our results do not depend strongly on how observers are modeled.^ However, 
they do depend on the measure, allowing us to discriminate between the causal diamond 
and the causal patch. Let us briefiy describe our most important findings. 

We find that the causal diamond measure is good agreement with observation for 
all parameter combinations, independently of details of the landscape (see Figs. 2-10). 
The observed values are within^ 2a in all plots, except if A is scanned over both positive 
and negative values, and Q is simultaneously scanned; in this case, they lie within 2(7 
or 3a. This is in large part because the negative range, A < 0, is between 12 and 25 
times more probable than the positive range, depending on the observer model. 

The causal patch measure, on the other hand, yields a nonintegrable probability 
distribution near |A| = in the absence of a cut-off, i.e., of a smallest possible value 
of |A|. This runaway is explained analytically in Sec. 5.3.^ The onset of this limiting 

^The case where both Q and A vary is an exception. When observers are modeled by a time delay, 
larger Q does not lead to a preference for larger A; with entropy production, it does. In neither case, 
however, do we find that our values of Q and A are very unlikely. 

^In some plots they appear just outside of 2cr, but by a margin that is negligible compared to the 
uncertainties in our computation both of probabilities and of the confidence contours. 

^This result, as well as the preference for large curvature mentioned below, was anticipated in 
unpublished analytical arguments by Ben Freivogel. 
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behavior is at |A| '--^ where tc is the time at which curvature comes to dominate. 
In particular, the runaway does not occur at aU in the absence of spatial curvature 

The strength of the runaway depends on the sign of the cosmological constant. For 
A < 0, the probability density grows like |A|~-^ as |A| 0, i.e., it grows exponentially 
in the display variable log^Lod^l/^o)- The rapid growth is evident in several plots in 
Figs. 12, 15, and 18. For A > 0, the probability density is independent of log;Lo(l^l/^o) 
for A <C t~^. Because of our limited parameter range, this milder runaway is not 
readily apparent in the relevant plots in Figs. 11, 14, and 17, but it can be predicted 
analytically. 

Thus, if spatial curvature is large enough to dominate before A does, the causal 
patch predicts a negative cosmological constant whose magnitude is the smallest among 
all anthropic vacua in the landscape. Whether this runaway is a problem or a success 
depends on the (unknown) size of the string landscape. It would certainly be a problem 
if the landscape is so large that it contains anthropic vacua with cosmological constant 
much smaller than Aq ~ 10~^^^ in magnitude. In this case the causal patch measure 
would predict at high confidence level that we should find ourselves in a vacuum with 
— Aq <C A < 0, and so would be ruled out. It might be a success, on the other hand, 
if the observed A corresponds to one of the smallest values available among the finite 
number of anthropic vacua in the landscape. The size of the landscape would be directly 
responsible for the observed scale 10~^^^, with the density of its discretuum providing 
an "ur-hierarchy" from which other hierarchies can be derived [22,25]. Even in this case 
the causal patch prefers negative values of the cosmological constant (and somewhat 
larger curvature than the observed upper bound), but only by a factor of order 10, not 
strongly enough to be ruled out by observation. 

At fixed values of A, the causal patch leads to a stronger preference for curvature 
than the causal diamond. This is explained analytically in Sec. 5.3. The pressure is 
particularly strong for A < 0, where the probability density grows very rapidly, like 
exp(— 9AA^), towards small values of AA^. This is not a true runaway problem, because 
there is a catastrophic boundary from the disruption of structure formation that will 
suppress the probability for suflBciently small values of AA^. However, after AA^ is 
marginalized, this eflFect would contribute additional weight to vacua with negative 
cosmological constant even if the runaway towards A = was suppressed by a lower 
bound Ajnin ^ Aq on the magnitude of the cosmological constant from the discretuum. 

Thus, we find evidence that the causal patch does not yield probability distributions 
compatible with observation, unless (1) we are very close to the smallest value of |A| 
in the discretuum (Amm ~ Aq), or (2) the prior probability distribution diflFers from 
what we have assumed (for example, by suppressing curvature so strongly that all 
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anthropic vacua can be treated as spatially flat, tc ^ ^mii 5 ^^^^ would be the case if 
all inflationary models in the landscape have a very large number of e- foldings). 

Another possibility is worth mentioning. The causal patch measure (with particu- 
larly simple initial conditions) was recently shown to be equivalent [21] to the light-cone 
time cut-off [19] on the multiverse. The latter is motivated [26] by analogy with the 
holographic UV-IR connection of the AdS/CFT correspondence. The boundary struc- 
ture in the future of eternally inflating regions differs sharply from that in the future 
of regions with A < 0. Since the analogy with AdS/CFT is most compelling in re- 
gions with positive cosmological constant, it is natural to consider the possibility that 
the causal patch measure may give correct relative probabilities only for observations 
in such regions. This restriction would eliminate the worst of the above problems, 
which pertain mainly to negative values of A. (It would also eliminte the divergence 
for A = [27,28].) There remains a weak (logarithmic) runaway towards A = from 
above (A > 0), but this would not be a problem if — logAmin ^ O(IOO), a plausible 
value for the string landscape [1,29]. 

Relation to recent work Our work can be regarded as a substantial extension and 
refinement of Ref. [23], where the probability distribution over positive values of A 
was estimated from entropy production in the causal diamond (the "causal entropic 
principle"). Here we consider a larger number and range of parameters, two different 
measures, and three different models for observers. Whereas in Ref. [23] the effects 
of the single parameter A on the star formation history were negligible in the most 
important range of the probability distribution, here we are forced to compute the 
entire star formation history numerically for each value of (A, Q, AA^). 

Other interesting extensions of Ref. [23] include Refs. [30-32]. Cline et al [30] 
compute a bivariate probability distribution over (positive) A and Q; and Bozek et 
al [31] compute a bivariate distribution over (positive) A and spatial curvature. In 
principle, these portions of Refs. [30, 31] could be regarded as special cases of the 
present work, with A > and either AN or Q held fixed, infinities regulated by the 
causal diamond measure, and observers modeled in terms of the entropy produced by 
dust heated by stars. However, our results differ because we model star formation and 
dust temperature differently. 

Both [30] and [31] employ the analytic star formation model of Hernquist and 
Springel (HS) [33]. This model was designed to closely fit data and numerical simula- 
tions of the first 13.7 Gyr of our own universe. The HS model exhibits some unphysical 
features when extrapolated to later times or different values of (A,Q,AA^). For ex- 
ample, because it does not take into account the finiteness of the baryon supply in a 
halo, the HS model predicts unlimited star formation at a constant rate after structure 
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formation is disrupted by a positive cosmological constant or by negative spatial cur- 
vature. Our own star formation model [7] includes only the most important physical 
effects governing star formation, and so provides only a rough (though surprisingly 
good) fit of the observed star formation history. However, our model includes not just 
those effects which govern star formation during the first 13.7 Gyr of our own uni- 
verse, but is designed to apply in a wide range of (A, Q, AA^), and at all times after 
recombination. DiflFerences between our results and those of Refs. [30,31] can be traced 
mainly to how we model star formation. A more subtle difference from Ref. [30] arises 
from our treatment of the dust temperature dependence on the virial density. A trivial 
difference from Ref. [31] is the choice of prior probability distribution for the parameter 
AA^. More detail is given in Sec. 2.2. 

Salem [34] computes a probability distribution over positive and negative values 
of A, with all other parameters fixed, using the causal patch measure. Observers are 
modeled as arising at a fixed time delay after the formation of galaxies that are similar 
to the Milky Way in a specific sense [34]. The special case in this paper most similar 
to Ref. [34] is our computation of a probability distribution over positive and negative 
A with Q and AA^ fixed, using the causal patch measure and modeling observers by 
a 10 Gyr time delay after star formation. Despite the different observer model, our 
results for this case agree very well with Salem's. We find that the observed value 
of A is nearly three standard deviations above the mean of the predicted distribution: 
99.7% of observers see a smaller value than ours, and most of them see a negative value. 
In fact, our observed value of A is outside 2a no matter how we model observers, as 
long as the causal patch is used. (The causal diamond is in better agreement with 
observation.) 

2. Making predictions in the landscape 

In this section, we will explain how we compute probabilities in the multiverse. We 
will explore two different measures, described in Sec. 2.1. In Sec. 2.2, we will discuss 
prior probabilities and cosmological selection effects, and in Sec. 2.3 we will describe 
three ways of modeling observers. In Sec. 2.4, we will explain how these ingredients are 
combined to obtain a probability distribution over the parameters (A, Q, AA^). 

2.1 Two choices of measure 

Before we can compute anything, we need to remove the divergences that arise in an 
eternally inflating universe. We will consider two slightly different measures: 
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Figure 1: The causal patch (shaded triangle) is the past of the future endpoint of a geodesic 
(vertical line) in the multiverse. The causal diamond (dark shaded) is the intersection of 
the causal patch with the future of the point B, where the geodesic intersects a surface of 
reheating (dashed). 



2.1.1 Causal patch cut-off 

The causal patch is the past of a point on the future boundary of the spacetime (Fig. 1). 
Consider a pocket universe described by the Friedmann- Robertson- Walker metric 

ds^ = -de + a\t) [dx^ + f{x)d^^] • (2.1) 

We consider only open universes (which include flat universes as a special case), so 
/(x) = sinhx. The causal patch is the set of points with x < Xpatch, where 

XWcW =1 ^ . (2^2) 

and tmax is the time of the crunch. For a long-lived de Sitter vacuum, we can take 

In any long-lived de Sitter vacuum (A > 0), the patch coincides with the interior 
of the event horizon, because a late-time decay into a terminal vacuum (with A < 0) 
does not affect the size of the event horizon at early times. In vacua with A < 0, the 
causal patch is the past of a point on the future singularity (the "big crunch" ) . We will 
not consider A = vacua in this paper. The causal patch has divergent four- volume in 
such vacua [27, 28]. 
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The causal patch cut-off was motivated by the resolution of the quantum xeroxing 
paradox in black holes [35,36]. Recently, the measure was shown to be exactly equiv- 
alent to the light-cone time cut-off [19,21], which was motivated by an analogy with 
the AdS/CFT correspondence [19,26]. The analogy is most compelling in eternally 
inflating regions of the multiverse ("eternal domains"). From this viewpoint, it is con- 
ceivable that the regime of validity of the causal patch cut-off is limited to vacua with 
A > 0. Our results will offer some phenomenological evidence for this possibility, in 
that we will find that the measure is least successful in vacua with A < 0. 



2.1.2 Causal diamond cut-off 

Consider a geodesic in the multiverse. The causal diamond is the intersection of the 
causal past of the future endpoint of the geodesic (the causal patch) with the causal 
future of some earlier point B. We will follow Ref. [23], where B was taken to be the 
point where the geodesic intersects the surface of reheating (Fig. 1). Thus, the causal 
diamond is the set of points with x < Xdia, where 



XdUt) = min{xpatch(i),^(i)} 

^-rj{t) , (2.3) 



where 

and T^max = ^(imax). 

Because of the additional restriction to the future of 5, the diamond cannot be 
larger than the patch. With our choice of 5, the diamond will be smaller than the 
patch approximately until A-domination, and it will coincide with the patch after A- 
domination. 

Our choice of B is motivated by the absence of matter prior to reheating. However, 
the concept of a reheating surface is not completely sharp. Nevertheless, the causal 
diamond may be an approximation to a more generally defined cut-off; a candidate will 
be discussed in future work. (In Ref. [22] , the point B was taken to be the starting point 
of the geodesic on some initial spacelike hypersurface. Then most pocket universes will 
lie entirely in the future of B. Except for very unnatural initial conditions, the region 
excluded from the diamond but present in the patch will be an empty de Sitter region 
with large cosmological constant. Thus, with this choice, the causal diamond gives the 
same probabilities as the causal patch.) 



-10- 



2.2 Prior distribution and cosmological selection 

The probability distribution over an observable parameter x can be operationally de- 
fined as the relative abundance of the various outcomes of all measurements of this 
parameter in the whole universe.^ It will be useful for us to think of this probability 
distribution as a convolution of the following three distributions: 

• Prior distribution. The relative abundance of different values of the parameter x 
among vacua in the theory landscape 

• Cosmological selection effects. The relative abundance of the different vacua in 
the universe will differ from the prior distribution because of selection effects of 
cosmological dynamics and/or initial conditions 

• Anthropic selection effects: whether, and how frequently, some value of x is 
observed may depend on x 

Once a measure has been chosen (see the previous subsection), all three distributions 
listed above can be computed. Let us discuss the first two in turn; we will devote a 
separate subsection to the third. 

Prior distribution Because the cosmological constant is effectively a random vari- 
able and A = is not a special point, the prior distribution of A can be approximated 
as fiat in the anthropically relevant regime (A <C 1): 

I o< 1 . (2^5) 

which translates into a prior proportional to A for log^g ^™ choice of display param- 
eter. 

We know much less about the prior distributions of spatial curvature, AA^, and 
the primordial density contrast, in the string landscape. There are certain prior 
distributions which seem implausible, such as a strong preference for large hierarchies 
(e.g., for small log^Q Q or large AA^), but this still leaves considerable uncertainty. For 
definiteness, Q will be assumed to have a prior which is fiat in log^g which we view 
as the most optimistic choice among reasonable alternatives: 

dp 



rflogioQ 

For curvature, Ref. [40] estimated 



oc 1 . (2.6) 



dp 1 /o 7\ 



*See, e.g., Refs. [13,37-39] for discussions of this claim. 
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We shall use this prior distribution together with the assumption that AA^ = corre- 
sponds to = 60. 

Despite the large uncertainties about the priors, our results will be robust in the 
following sense: In cases where we find a tension between the prediction of a measure 
and observation, this tension could only be removed by choosing a highly implausible 
prior on Q or /\N . 

(No) cosmological selection Relative to the very high energy scales that determine 
the decay channels and decay rates of metastable landscape vacua, the length of infia- 
tion and the mechanism for generating density perturbations can plausibly be assumed 
to arise at relatively low energies, and thus, to be uncorrelated with the production rate 
of various vacua. This also holds for the cosmological constant, since we are interested 
only in an anthropic range of values, A <C 1. These values can only be measured at 
very low energy density, and so cannot be correlated with the nucleation rate of vacua. 
Therefore, we will ignore cosmological selection effects.^ 

2.3 Three ways of modeling observers 

Finally, we must compute the expected number of instances of observing different values 
of the parameters (A, AA/') in the cut-off region. In general, these values will be 
correlated with the presence and number of observers, so we must compute the number 
of observers as a function of (A, Q, AN). In principle, there is no reason why such a 
computation could not be performed in a sufficiently powerful theory, by a sufficiently 
able theorist. In practice, we struggle to define "observer" or "observation" in complete 
generality. In this paper, we will consider three models for observers. In the first two, 
we focus on observers "like us" , which arise near a star, a certain number of years (5 
Gyr in the first model, 10 Gyr in the second) after the formation of stars. The third 
model uses entropy production as a more general proxy for observers [22,23].^ We will 
now describe these models in more detail. 

^We assume, however, that there is no "staggering problem" [41,42], in which cosmological selection 
effects lead to such unequal probabilities for different vacua as to effectively eliminate most of the 
landscape and render it unable to solve the cosmological constant problem. The presence of this 
problem depends on details of the landscape, and in the case of the two local measures considered 
here, on initial conditions. It is absent in plausible toy models [43,44]. 

^In combination with the causal diamond cut-off this has been called the "Causal Entropic Princi- 
ple". However, we should stress that the question of modeling observers is, at least naively, orthogonal 
to the measure problem. Entropy production could be used as an observer proxy essentially in any 
measure. 
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2.3.1 Observers = stars + time delay of 5 or 10 Gyr 

We are at liberty to restrict our attention to any class of observers that includes us, for 
example, observers that emerge near stars. This probably ignores some observers in 
the landscape. There may be regions without stars that nevertheless contain observers 
powered by some other source of free energy. However, we can still ask whether our 
observations are typical of observers in the class we have defined; a theory in which 
they turn out to be highly atypical can be ruled out. 

We need to know both the spatial and temporal location of observations in order to 
compute what they will observe. Concretely, let us assume that each star, on average, 
gives rise to some number of observers after a fixed "evolutionary" delay time tdeiay 
Then the number of observations made at the time t, per unit comoving volume and 
unit time, is 

|||(t)(Xp.(t-tdelay), (2.8) 

where p^{t) = (fm^/dVcdt is the star formation rate, i.e., the amount of stellar mass 
produced per unit time and per unit matter mass. Because we assume a fixed initial 
mass function for stars, this equation holds independently of how the number of ob- 
servers may depend on the mass of the star, so we will not need to make any particular 
assumption about this distribution. 

In this paper, we explore two choices: tdeiay = 5 Gyr, and tdeiay = 10 Gyr. The 
first choice corresponds to the evolutionary timescale of life on earth. It defines a class 
of observers that are like us in the sense that they exist at equal time delay after the 
birth of their respective star. In this case, stars with lifetimes less than 5 Gyr do not 
contribute to observations and should be excluded in Eq. (2.8). However, by the remark 
at the end of the previous paragraph, this only aflFects the constant of proportionality 
in Eq. (2.8), but not the normalized probability distributions. 

The second choice defines a slightly different class of observers, which are like us 
in the sense that they exist 10 Gyr after most stars in the universe are produced. (In 
our universe, the peak of the star formation rate was about 10 Gyr ago.) 

2.3.2 Observers = entropy production 

The second law of thermodynamics guarantees that the spontaneous formation of an 
ordered subsystem (like a frozen pond, or a galaxy) will be compensated by increased 
entropy in the remaining system. In practice, this increase tends to overcompensate 
vastly, and the overall entropy increases. This motivated one of us to propose [22] that 
the emergence of complex structures such as observers is correlated with the production 
of entropy. The simplest ansatz is that the rate of observation is proportional (on 



-13- 



average) to the rate of entropy production: 



(PUohs 

dVcdt 



(t)cx 



dVcdt 



it). 



(2.9) 



In Ref. [23], it was shown that most of the entropy produced inside the causal 
diamond in our universe comes from dust heated by stars. In the absence of complex 
molecules, let alone stars and galaxies, the entropy production would be much lower. 
The simple criterion of entropy production thus turns out to capture several conditions 
often assumed explicitly to be necessary for life. Moreover, it succeeds very well in 
postdicting our rather unusual location: If life is correlated with entropy production, 
then most life forms will find themselves when and where most entropy production 
takes place: near stars, during the era while stars are burning. Indeed, the probabil- 
ity distribution over positive values of A computed from the causal entropic principle 
(weighting by entropy production in the causal diamond) proved to be in excellent 
agreement with observation [23] . 

Here we refine and extend the validity of that prescription. We will use the star 
formation rates calculated according to Ref. [7] in order to properly account for the 
changes in star formation as cosmological parameters are varied. In addition, we will 
account for the dependence of the dust temperature on the virial density. 

In contrast to the causal diamond, overall entropy production in the causal patch 
is dominated by the entropy produced at reheating. This is evidently not a good 
proxy for observers. In the context of the causal patch cut-off, we will model observers 
specifically in terms of the entropy produced by dust heated by stars (or, as above, by 
a time delay) . 

2.4 Summary 

The rate of observation, per unit comoving volume and unit time, in a universe with 
parameters (A, Q, AA^) is given by 



depending on which model for observers we use; see Sec. 2.3. In the second (third) 
case, we set the rate of observation to zero for t < 5 Gyr {t < 10 Gyr). 

The total number of observations in a universe with parameters (A, AA^) is 
given by integrating the above rate over the time and comoving volume contained in 



dVcdt 



(t; A,g,AA^) oc 



^^{t] A, Q, AN) (entropy production) 

p^{t — 5 Gyr; A, AA^) (star formation plus 5 Gyr) 

p^{t — 10 Gyr; A, Q, AA^) (star formation plus 10 Gyr) 



(2.10) 
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the causal patch or causal diamond: 

TOO 

nobs(A, Q, A7V) = dt V,{t) ^y^it; A, Q, AN) , (2.11) 
where the comoving volume at time t is given by 



^ (^) i J^^p^*"^^^^^ dx 47r sinh^ x (causal patch measure) 

jxdiamond(t) ^-^^2 ^ (causal diamond measure) 



and Xpatch and Xdiamond are given in Eqs. (2.2) and (2.3). 

The probability distribution over (A, Q, AA^) is obtained by multiplying the prior 
probability for a universe with (A, (5,AA^), discussed in Sec. 2.2, by the number of 
observations made in such a universe: 

"^'^ ^ nobs(A,g,AA^) . (2.13) 



d log A logio Q d{/^N) (60 + A A^)4 

With three choices of observer model and two choices of measure, we thus consider a 
total of six different models for computing probabilities in the multiverse. 



3. Star formation and entropy production in the multiverse 

In this section we describe in detail how we compute the quantities appearing in 
Eq. (2.10). In Sec. 3.1 we review our star formation model [7]. In Sec. 3.2, we es- 
timate the rate of entropy production by the dust heated by stars, following [23,45]. 

3.1 Star formation 

To compute the rate of star formation per unit time and unit comoving volume in a 
universe with parameters (A, AA^), 

we use the model we developed in Ref. [7]. The following summary will skip many 
details, and the reader is encouraged to consult Ref. [7] for a more thorough discussion. 
There are three steps to star formation: (1) density perturbations grow and collapse to 
form dark matter halos; (2) baryons trapped in the halo cool and condense; (3) stars 
form from the cooled gas. 

Cosmological perturbations can be specified by a time-dependent power spectrum, 
7^, which is a function of the wavenumber of the perturbation, k. The r.m.s. fiuctuation 
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amplitude, a, within a sphere of radius i?, is defined by smoothing the power spectrum 
with respect to an appropriate window function: 



The radius R can be exchanged for the mass M of the perturbation, using M = 
4:7Tp^R^/3. GeneraUy a factorizes as 

a(M,t) = Qs{M)G{t) . (3.3) 

Q sets the overaU scale of density perturbations, and is one of the parameters we vary. 
The scale dependence s{M) is held fixed; we use the fitting formula provided in Ref. [46]: 

s{M) = [(9.1/i-2/3)-°-''+ (50.51ogio (834 + ^-1/3) -92)"°-'']" ' (3.4) 
with /X = M/Mq(^^ where 

Meq = 1.18 X lO^^m© (3.5) 

is roughly the mass contained inside the horizon at matter-radiation equality. The 
linear growth function G{t) satisfies 

+ 2H-^ = AttG^p^G (3.6) 

with the initial conditions G = 5/2 and G = 3H/2 at t = tgq- For each value of A and 
AA^, we numerically compute the scale factor a{t) and, from Eq. (3.6), G{t). 

Density perturbations grow and collapse to form structure. The Press-Schechter 
function, F, gives the total fraction of mass collapsed into structures of mass < M [47]: 

F(< M,t) = Erf ( -^'^^ ^ . (3.7) 

We can compute the mass density of a collapsed halo (called the virial density, Pvir) 
using the spherical top-hat collapse model. The virial density does not depend on the 
mass of the object, but only on the time of collapse. 

After collapse, the baryonic component of a halo must cool and undergo further 
condensation before stars can form. We require that this cooling process happen suffi- 
ciently quickly. The most efl&cient cooling mechanisms require ionized gas, so only those 
halos with a virial temperature above 10^ K can cool further. This translates into a 
time-dependent lower mass limit for star-forming halos. Also, we require that halos 
cool on a timescale faster than their own gravitational timescale, tgrav = (GNPvir)""^^^- 
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This is motivated by observations indicating that coohng-hmited galaxy formation is 
ineffective, providing a time-dependent upper mass hmit on star-forming halos. 

A halo in the allowed mass range is assigned an individual star formation rate 
based on its mass and time of virialization: 

^ single ^ 1 [32 M , , 

We use the extended Press-Schechter formalism [48] to sum over the formation times 
of all halos of mass M in existence at time t: 



^ avg 1 /oo rt 



dt 



M dP , ; 

-(tvir,M,t) 



^gr av ( ^ vir ) dt^ 



dtvir . (3.9) 



The function P is the probabihty that a halo of mass M at time t viriahzed before 
and is derived in Ref. [48]: 



M dB 

P(<tvir,M,t)= / — -£-(Mi,tvir,M,t)dMi , (3.10) 

Jm/2 Ml dMi 



where 



/?(Mi,ti,M2,t2) = Erfc I ^'^^ ( - ^7^1 1 . (3.11) 

tmin and tmax ^rc specific functions of M and t designed to restrict the range of inte- 
gration to only those halos which are capable of cooling and have not yet used up all 
of their cold gas supply [7] . 

Finally, the star formation rate itself is given by summing over all values of halo 
mass, weighted by the Press-Schechter distribution function: 



(3.12) 



3.2 Entropy production 



As explained in Sec. 2.3, we model the rate of observation either by the rate star 
formation plus a time delay, or by the rate of entropy production by stars. A time 
delay is trivial to implement: we simply shift the star formation rate by 5 or 10 Gyr. 
Here we will discuss how to estimate the rate of entropy production per unit comoving 
volume and unit time. The entropy production rate at the time t is given by the total 
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luminosity of the stars shining at that time, divided by the effective temperature at 
which this power is ultimately dissipated. A significant fraction of starlight is absorbed 
by interstellar dust grains and re-emitted in the infrared. This process converts one 
optical photon into many infrared photons, so it dominates entropy production [23]. 
Hence, the appropriate temperature is the interstellar dust temperature. 

We will follow Ref. [23] for computing the total luminosity at time t from the star 
formation rate at earlier times. We will follow Ref. [45] for estimating the temperature 
of dust at time t in a galaxy that formed at time tf. This doubly time-dependent 
treatment of the dust temperature is a refinement over Ref. [23] , where the temperature 
was held fixed. 

We take the luminosity of an individual star to be related to its mass by oc m^'^. 
The mass distribution of newly formed stars is assumed to be described by the Salpeter 
initial mass function, independently of time and of the parameters (A, AA^): 



No stars form (^imf(^) = 0) outside the range 0.08 < m < 100 m©. Here a and b 
are constants chosen so that the function is continuous and integrates to one over the 
allowed mass range. 

The lifetime of a star is also controlled by its mass; smaller stars live longer. It is 
convenient to work with the inverted relation [23] : 



where m^s,^{At) is the mass of the largest survivors in an ensemble of stars created a 
time At ago. Setting mmax = 0.08 corresponds to At = 8~^/^10^Gyr ^ 5500 Gyr, 
the lifetime of the longest-lived stars. 

Now consider an ensemble of stars of total mass dm^ that formed at the time tf. 
Their combined luminosity at the time t — tf + At is independent of and is given by 



The mass and luminosity of the sun, and L©, and the average initial mass, (m), 
are constant and drop out in all normalized probabilities. We will continue to display 
them for clarity. 

Next, we turn to estimating the temperature of interstellar dust, at which this 
luminosity is ultimately dissipated. The dust temperature will depend on the mass 




(3.13) 




(3.14) 




(3.15) 
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density of the host galaxy (higher density means higher temperature), and on the 
CMB temperature. The CMB temperature is borderhne neghgible in our universe. 
But in a large portion of our parameter space (specifically, for Q > Qo), significant 
star formation occurs earlier than in our universe. Then the CMB temperature can be 
larger than the temperature the dust would reach just from stellar heating, and so it 
can effectively control the dust temperature. This is an important effect mitigating the 
preference for larger values of Q [45]. 

Ref. [49] [see Eq. 170 therein] models how the temperature of the interstellar dust 
scales with the distance to a star:'' 

^(^vir, tf oc f + TcMB(t)' , (3.16) 



where we have included the last term to account for the heating of dust by the CMB. 
Here, i? is a typical distance between stars, and is a typical stellar temperature (we 
use 6000 K). We are explicitly dropping an overall dimensionful factor because we are 
only interested in normalized probabilities. One expects the interstellar distance to 
scale inversely with the density of the host galaxy, which in turn is proportional to the 
virial density of halo: 

i?3 cx p^iJ . (3.17) 

We normalize to the value {R^/Rf = 3.5 x 10"-^^ for our galaxy [49], which we assume 
formed with a virial density typical of halos forming at tyir = 3.7 Gyr. Then Eq. (3.17) 
determines the relevant R for other galaxies that form at different times in our universe 
or in others. Note that the virial density is set by the time tyir of the last major merger, 
whereas the CMB temperature appearing in Eq. (3.16) must be evaluated at the time 
of emission. In our model, stars do not form for a long time after virialization, 

^-l^ < 1 , (3.18) 

Thus we can approximate tyir ^ for the purposes of estimating the relevant dust 
temperature, leaving us with one fewer time variable to keep track of. 

To compute the entropy production rate, at the time t = tf + At, per unit stellar 



^The powers of temperature in this relation arise because the dust neither absorbs nor emits 
as a blackbody. However, our results would not change much if we adopted the blackbody ansatz 
= {^) + ^CMB- This is reassuring since the regime of validity of Eq. (3.16) depends on 
material properties which are difficult to estimate. 
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mass formed at the time t/, we divide the luminosity by the temperature: 

-it, = . , ..^ :j-i^t) (3.19) 



dm^dt ' T{tf^tf + At) dm^ 



.X / \ / cimCiMpM . (3.20) 

From this rate we can obtain the total entropy production rate at the time t, by inte- 
grating over tf and using the SFR to account for the amount of stellar mass produced 
per comoving volume at the time tf'. 

d S , ^ /*, d S , \./\ / ^ ^ ^ \ 

dU- -{t,U) p^{U) . (3.21) 



dVcdt Jo dm^dt 

By Eq. (2.10), this becomes the integrand in Eq. (2.11) in the case where observers are 
modeled by entropy production.^ 



4. Results 

This section contains the probability distributions we have computed over the param- 
eters A,Q,AN. 

Ordering There are six subsections, corresponding to the six models described in 
Sec. 2 (two different measures, and three different ways of modeling observers). Each 
subsection contains three pages of plots. On the first page, A runs only over positive 
values; on the second, A < 0. This division is useful since some of the interesting 



^Numerically, the resulting double integral is most efficiently evaluated by exchanging the order of 
integration: 

nobs(x / dtV,(t) ^^(^) (3.22) 

(t,tf) p.(tf) (3.23) 







dt / 


dtf 






dti J 


' d 

ti 

POO 


dtf 






Jo 



dm^dt 
. d^S 



■{t^ti) p,{ti) (3.24) 



dm^dt 

P.(tf) . (3.25) 



d{M) K(^f + At) - — -(tf + At,tf) 
am^at 

The inner integral represents the entropy that will eventually be produced inside the causal patch or 
diamond by the stars created at time tf. Because it does not depend on Q, it is more efficient to 
compute this integral separately as a function of (tf;A, AA^), before multiplying by p^(tf ; A, Q, AA^) 
and computing the outer integral. 
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features we find depend on the sign of tlie cosmological constant. Moreover, visually the 
clearest way to display the probability distribution over A is as a function of log^Q |A|, 
which makes it difficult to include A = 0. On the third page of each subsection, we 
display some distributions over all values of A. 

We have computed a full trivariate probability distribution for each case, which 
cannot be displayed in a single plot. For this reason, the first page (A > 0) and second 
page (A < 0) of every subsection contain 12 plots each. The first six plots (the top two 
rows of plots) are single variable distributions over A, over Q, and over AA^. In the first 
three, the remaining two variables are held fixed. This corresponds to asking about 
the probability distribution over a single parameter in the portion of the landscape 
in which the two other parameters take the observed values. In the second group of 
three plots, the remaining two variables are marginalized (i.e., integrated out). This 
corresponds to asking about the probability distribution over a single parameter in the 
entire three-parameter landscape we consider. 

The remaining six plots on each page are bivariate probability distributions. Of 
these, the first three are distributions over two parameters with the third held fixed. 
This corresponds to asking about a bivariate probability distribution in the portion 
of the landscape in which the remaining parameter takes its observed value. In the 
other three bivariate plots, the remaining variable is integrated out. This corresponds 
to asking about the probability distribution over some pair of parameters in the entire 
three-parameter landscape we consider. 

The third page of each subsection shows distributions in which A takes both positive 
and negative values, either explicitly or by marginalization. The three plots in which 
A is fixed to the observed value would be identical to the corresponding plots shown on 
the A > page. Moreover, we do not display any plots corresponding to a parameter 
combination that led to a pathological probability distribution for either A > or 
A < 0, when the inclusion of both signs can only worsen the problem. (This case arises 
for the causal patch only.) 

Confidence regions In most plots (see the discussion below), we show the one-sigma 
(68% confidence) and two-sigma (95% confidence) parameter regions. The one-sigma 
region is unshaded, the two-sigma region is lightly shaded, and the remaining region is 
shaded dark. In the one-parameter plots, confidence regions are centered on the median 
of the probability distribution. In the two parameter plots, they are centered on the 
maximum probability density and bounded by contour lines of constant probability. 
Additional contours are included for better visualization of the probability distribution. 
They not drawn at any special values of the probability density or of its integral. 

The displayed confidence regions are strictly based on the probability distribution 
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over the portion of the landscape for which we have computed probabihties: 



10"% < |A| < 10^ Ao 

lO-^Qo < Q < lo'go 
-3.5 < AA^ < oo , 



(4.1) 
(4.2) 
(4.3) 



with all other physical parameters held fixed. In other words, we are setting the proba- 
bility to zero outside the above range, and for universes in which other parameters differ 
from ours. As we noted in the introduction, this is legitimate: we are asking whether 
or not our observations are typical among those made by observers in this portion of 
the landscape, described by a particular range of three particular parameters. If they 
are highly atypical, then there is a problem. 

In certain plots involving A < with the causal patch measure, the probability 
density increases very rapidly towards the boundary of our parameter range. Because 
of this runaway behavior, the 1 and 2a regions would depend sensitively on the precise 
value of the parameter boundary. In these cases, we do not display confidence intervals 
in single- variable plots; in bivariate plots, we display only the contours of constant prob- 
ability density, along with an arrow indicating the direction of increasing probability 
density. Other runaways are less strong; in this case we do display confidence intervals 
based on the above parameter range. Finally, not every probability distribution that 
increases monotonically towards a boundary is indicative of a runaway, because it might 
be cut off by a change of regime at finite distance beyond the boundary: For any value 
of Q and A, sufficiently large curvature will disrupt structure formation. And for any 
AA^ and A, sufficiently large Q (of order the upper bound we consider, lO^Qo) leads 
to a change of regime. We will note in the captions which plots have a true runaway 
direction. 

Display range and data point We display the entire range of parameters for which 
we have computed the probability density, Eq. (4.3), except, of course, for A A, where 
we cut off the display at A A = 1. For larger values of A A, curvature is too weak to 
affect either the dynamics of structure formation or the geometry of the causal patch 
or diamond. In this regime, the probability distribution over AA is proportional to 
the prior distribution, Eq. (2.7). All contour intervals take this undisplayed portion 
of the probability distribution into account. Also, when we marginalize over A A, the 
undisplayed portion is included in the range of the integral. 

The display variables are not A, and AA, but log;Lo(l^l/^o)5 logiolQ/Qo), and 
AA. Therefore, the observed values correspond to on every axis. To guide the eye, 
the vertical axis intersects the horizontal axis at in all single-parameter plots, so the 
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observed value is where the vertical axis is. In the two-parameter plots, the data point 
(0, 0) is shown by a green triangle. 

There are two subtleties: First, in the figures that display only the negative range 
of A, the observed (positive) value cannot be shown. We find it useful to show as 
"data" our "evil twin" universe, with A = — Aq, Q — Qo^ and AN > 0, in these plots. 
Secondly, spatial curvature has not been detected, only constrained. Thus, AN — 
corresponds to a lower bound, and not to the actual value of AA^ in our universe. 
The reader should keep in mind, therefore, that in single-variable plots over AA^ the 
entire region to the right of the vertical axis is compatible with observation. In two- 
parameter plots involving AA^, the observed universe lies somewhere on a semi-infinite 
line starting at the triangle and running upward towards larger AA'. As long as some 
part of the AA^ > range is not very improbable, there would be no confiict with 
experiment, even if the point AA^ = were highly improbable. 

Comparing the probability distributions to observation Because life is short, 
we reject scientific theories with finite confidence only, taking a gamble that freak 
chance might be leading us astray. Often, we can increase our confidence by repeating 
experiments. In cosmology, we sometimes cannot. This limitation has nothing to do 
with the multiverse, but stems from the finiteness of the observable universe. (Because 
Aq ^ 0, the observable universe will not grow indefinitely, so this is not merely an 
accident of our present era.) For example, cosmic variance does not prevent us from 
drawing conclusions from the CMB, but it does prevent us from sharpening them when 
they do not meet a confidence standard we are comfortable with, as may be the case 
for the low quadrupole. 

There is one data point for each parameter considered in this paper, the value 
observed in our universe (or, in the case of AA^, the range not ruled out). If this data 
point happened to have very small probability (e.g., if it lay well beyond 6(7, if this were 
our desired level of confidence), then our observations would be extremely unexpected 
given the theory from which the probability distribution was computed. In other words, 
the theory would conflict with experiment at that level of confldence. Since the theory 
consists of a combination of a prior distribution in the landscape (including the possible 
range of parameters), a choice of measure, and a choice of observer model, at least one 
of these elements is excluded. 

Main conclusions Our conclusions are more fully described in the introduction. The 
causal diamond measure is remarkably successful. The observed values of parameters 
lie in the 2a confldence region of all but one or two out of thirty-three probability 
distributions we show for each observer model (where they lie within 3a). 
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The causal patch is problematic, independently of the observer model. In the ab- 
sence of curvature, results are similar to the causal diamond, if not quite as successful 
quantitatively. In the presence of curvature, we find significant confiicts between pre- 
diction and observation. They are sharpest in distributions that include negative values 
of A (see, e.g.. Fig. 16), where we find a strong runaway towards small |A| and strong 
pressure towards the large-curvature boundary on structure formation. If we restrict 
to positive A, there is still a weak runaway to small positive values of A, though this 
is barely visible in the parameter range we display. As discussed in the introduction, 
these runaways imply that the causal patch measure is incorrect, or that the prior 
probabilties differ significantly from those we have assumed, or that the finite size of 
the landscape effectively provides a cut-off on how small A can be, and that we find 
ourselves near this cut-off. The level of confidence at which the measure is excluded by 
the data depends sensitively on this cut-off; with no cut-off, it is infinite. 

4.1 Weighting by entropy production in the causal diamond 

With this observer model and measure, the observed parameters fall within the central 
2(7 of almost all probability distributions we compute. The only exceptions are the 
bivariate distributions over Q and the full range of A (both positive and negative A), 
where we find ourselves outside of 2a but within 3a (6th and 9th plot. Fig. 4). The 
total probability for A < is 25 times larger than that for A > when Q = Qo and 
AA^ > are held fixed, and 11 times larger when Q and AA^ are marginalized (1st and 
2nd plot. Fig. 4). 

As explained in Sec. 5.4, the distribution over with A = Aq and AA^ > 0, has 
a maximum centered on the observed value (2nd plot. Fig. 2). This feature is not 
reproduced with any other choice of measure or observer model. However, this does 
not in itself lend particular weight to this measure and observer model. Any model in 
which the observed value falls, say, within 2a should be regarded as unproblematic. 

As explained in Sec. 5.5, the most likely value of Q grows with A (9th and 12th 
plot. Fig. 2); therefore, the observed value of Q is no longer the most likely after A is 
integrated out (5th plot. Fig. 2), though it is well within 2a. 

For A = Ao, the 3rd plot in Fig. 2 shows that too much curvature suppresses struc- 
ture formation: the probability distribution increases towards smaller AA^ (governed 
mainly by the prior distribution), but turns around near AA^ = —2. For negative 
cosmological constant (Fig. 3), the analogous plot does not show this feature because 
the turnaround occurs just outside our display range. The reason for this diflFerence is 
discussed in Sec. 5.3. 
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Q=Qo and AN>0 Fixed 



A=Ao and AN>0 Fixed 





Q and AX Marginalized 



-!.0 -0.5 0.0 L.O 1.5 2.0 



A and AN Marginaiized 



l^gm Q/Qo 




A and Q Marginalized 




Logj„ A/A^J 



Figure 2: A > 0, Causal Diamond, Entropy Production. This is the only model studied 
which produces a peak in the Q distribution of the 2nd plot. 
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|A|=A) and 0=Qu 

Q= Qij and AN> Fixed | A | =^\j and A>>0 Fixed 




-1.0 -0.5 0.0 0.5 1.0 1.5 2.0 -2 2 4 -2 2 4 

Log,(, Q/Qu Logiu |A|/Au Lognj |A|/Ao 



Figure 3: A < 0, Causal Diamond, Entropy Production. Unlike for A > 0, the probability 
density increases monotonically with curvature (3rd and 6th plot). However, the range al- 
lowed by observation (AA^ > 0) overlaps with the central la region (unshaded). This good 
agreement is not an artifact of our lower cut-off, AA^ > 3.5, because the probability is about 
to be suppressed by the disruption of structure formation for slightly smaller AA^. This can 
be seen by studying the distribution over AA^ at Q = IQ-^-^Qo in tlie 7th and lOtli plots. 
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Figure 4: All values of A, Causal Diamond, Entropy Production. A < is preferred over 
A > by a factor of 25 (11) in the 1st (2nd) plot. The observed values of (A, Q) lie outside 
2(7, but within 3a, in the 6th and 9th plot. 



4.2 Weighting by star formation + 5 Gyr in the causal diamond 

For this choice of measure and observer model, we find that the observed values of 
parameters lie within or at the 2a contours of all plots with a single exception: in 
the bivariate distribution over Q and the full range of A (both positive and negative 
A) with AA^ marginalized (9th plot, Fig. 7), we find ourselves within 3a. The total 
probability for A < is 13 times larger than that for A > when Q and AA^ are held 
fixed, and 12.6 times larger when Q and AA^ are marginalized. 

Unlike in the entropy production model shown previously, the distribution grows 
monotonically towards large but not rapidly enough to render the observed value 
unlikely. The preferred positive value of A is virtually independent of Q, as explained 
in Sec. 5.5, and is roughly equal to the observed value, as seen in the 9th and 12th plot 
in Fig. 5. 
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Figure 5: A > 0, Causal Diamond, 5 Gyr delay time. The preferred value of A is independent 
of Q, as seen in the 9th and 12th plots. 
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Figure 6: A < 0, Causal Diamond, 5 Gyr delay time. Note the peculiar feature around 
Q = Qo in the 2nd plot. (It can also be seen in the 7th plot.) The increasing probability 
towards small AA^ in the 3rd and 6th plot is not a runaway; see the caption of Fig. 3. 
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Figure 7: All values of A, Causal Diamond, 5 Gyr delay time. As with entropy production, 
A < is preferred over A > (here, by a factor of 13). The observed values of (A, Q) lie 
outside 2a, but within 3a, in the 6th and 9th plot. 



-31 - 



4.3 Weighting by star formation + 10 Gyr in the causal diamond 

The 10 Gyr delay model shares many features with the 5 Gyr delay model, with only 
small numerical differences. The preferred value of A is again nearly independent of Q 
but slightly smaller than with the 5 Gyr delay This feature is explained in Sec. 5.5. 

The observed values of parameters lie within or at the 2a contours of almost all 
plots. They are within 3a of some of the distributions that range over both positive 
and negative values of A. The total probability for negative A is 12.4 times larger than 
that for positive A when Q and AA^ > are fixed, and 12.8 times larger when Q and 
AA^ are marginalized. 
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Figure 8: A > 0, Causal Diamond, 10 Gyr delay time. As with the 5 Gyr delay, we find a 
monotonic distribution over Q (2nd and 5th plot). The preferred value of A is independent 
of Q, and is somewhat smaller than with the 5 Gyr delay. 
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Figure 9: A < 0, Causal Diamond, 10 Gyr delay time. The 3rd and Gth plots show an 
increase in probability toward small AA^. This is not a runaway; see the caption of Fig. 3. 
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Figure 10: All values of A, Causal Diamond, 10 Gyr delay time. As usual, A < is more 
likely than A > 0. In the 9th and 12th plot, the observed values of (A, Q) lie outside 2a but 
within 3a. 
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4.4 Weighting by entropy production in the causal patch 

We now turn to the causal patch measure. Independently of the observer model, for 
A < 0, the rapid divergence as |A| (Sec. 5.3) prevents us from displaying confidence 
regions in the aflFected plots in Figs. 12, 15, and 18. For A > 0, the probability density 
over log;Lo(l^l/^o) goes to a constant for A <C This runaway is mild enough that 
we display confidence regions (based, as always, on an assumed cut-off at the end of 
our parameter range) in the relevant plots in Figs. 11, 14, and 17. The runaway is 
not always readily apparent but we have indicated all runaway directions with arrows. 
For A < 0, the growth towards small AA^ (also described in Sec. 5.3) is so rapid at 
AA^ = —3.5 (the lower end of our parameter space) that confidence regions are very 
sensitive to the exact position of this cut-off and are not displayed. 

In this subsection, we begin with the case where observers are modelled by entropy 
production. For A > 0, we find that the observed values are at or within 2cr, except in 
those distributions which are susceptible to the above runaway towards small A. For 
A < 0, the only acceptable results obtain when curvature is fixed and negligible. Even 
in the absence of curvature, with AA^ and Q fixed, negative values of the cosmological 
constant are more probable than positive values, by a factor of order 10. (As explained 
in Sec. 5.1, the distribution in A is wider than in the causal diamond case and so has 
significant support at the boundary of our parameter space. Thus, computing a more 
precise value of the relative probability for different signs of A, from our data alone, 
would not be very informative.) 

Unlike the case of the causal diamond with the same observer model, the distri- 
bution in Q is monotonically increasing for A = Aq and AA^ > 0. This difference is 
explained in Sec. 5.4. Like in the case of the causal diamond, the preferred value of A 
grows with Q (see Sec. 5.5). 
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Figure 11: A > 0, Causal Patch, Entropy Production. The probabihty density goes to 
a constant for A ^ t~^. This runaway is indicated by arrows. With AA^ > 0, curvature 
dominates so late that the runaway is not evident even at the smallest values of A in the first 
plot. In the 8th and 11th plot, however, the distribution can be seen to flatten out towards 
smah A at AN -3. 
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Figure 12: A < 0, Causal Patch, Entropy Production. The probabihty density grows hke 
|A|~^ for A ^ t~^. This runaway is indicated by arrows. At fixed A, small AA^ is strongly 
preferred (3rd and 7th plots). 
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Figure 13: All values of A, Causal Patch, Entropy Production. In the presence of curvature, 
A < is preferred over A > by an arbitrarily large amount, depending on the cut-off in A. 
In the 5th plot, the observed value of (A, Q) lies outside 2cr, but within 3a. 
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4.5 Weighting by star formation + 5 Gyr in the causal patch 

This case is very similar to the previous subsection, and we refer the reader to its text 
and captions. 
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Figure 14: A > 0, Causal Patch, 5 Gyr delay time. In the absence of curvature, the preferred 
value of A is independent of Q, as seen in the 9th plot. For other comments, see the caption 
of Fig. 11. 
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Figure 15: A < 0, Causal Patch, 5 Gyr delay time. In the absence of curvature, the preferred 
value of A is independent of Q, as seen in the 9th plot. For other comments, see the caption 
of Fig. 12. 
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Figure 16: All Values of A, Causal Patch, 5 Gyr delay time. Negative A is preferred here 
by a factor of order 10 when curvature is absent. In the 5th plot, the data point is within 3a. 
For other comments, see the caption of Fig. 13. 
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4.6 Weighting by star formation + 10 Gyr in the causal patch 

This case is very similar to the previous subsection, and we refer the reader to its text 
and captions. 
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Figure 17: A > 0, Causal Patch, 10 Gyr delay time. See caption of Fig. 14. 
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Figure 18: A < 0, Causal Patch, 10 Gyr delay time. See caption of Fig. 15. 



- 46 - 




-47- 



5. Discussion 



The main purpose of this section is to provide an intuitive quahtative understanding of 
the most important features in the plots of Sec. 4. We wiU also supply some additional 
quantitative results that are not immediately apparent in the plots. (In particular, we 
will compute the probability that nonvanishing spatial curvature will be detected in 
the near future, at the end of subsection 5.2.) 

What are the physical consequences of varying the parameters (A, AN)7 Vary- 
ing A or AA^ changes both the dynamical evolution, and the geometry of the cut-off 
region. Dynamically, these parameters affect the rates of star formation and entropy 
production. Geometrically, they affect the amount of comoving volume contained 
within the causal patch or the causal diamond. In several probability distributions, 
the geometric effect is quantitatively more important than the dynamical effect. 

The parameter on the other hand, enters the probability distribution only dy- 
namically, through its effects on the rate at which observers form. In an approximately 
homogeneous universe, the initial strength of density perturbations does not have an 
important effect on the comoving volume within the patch or diamond. (Ref. [32] dis- 
cusses possible effects of the breakdown of the homogeneous approximation, which are 
not modeled here.) 

5.1 Varying the cosmological constant only 

Let us begin by discussing the probability distribution over the cosmological constant 
alone, with both remaining parameters fixed to their observed values. Its effects on 
star formation and on the geometry of the causal patch or diamond depend strongly 
on the sign of the cosmological constant. We begin by discussing the case A > 0; our 
treatment follows Ref. [23]. 

With A > 0, structure formation, and thus star formation, halts after a time of 
order 

tA ^ (3/A)V2 _ 

However, this effect becomes important only for relatively large values of A of order 
IOOAq. As we shall now see, these values are already suppressed by geometric effects. 
We will self-consistently treat the rate of observation, hohs{t)i as fixed. (We emphasize 
that this approximation, like all others in this section, is made for the sole purpose of 
elucidating our plots, which are always obtained using a full numerical calculation of 
both the geometry and the observation rate.) 

For A > 0, a period of matter domination is succeeded, around the time ^a, by 
an infinite period of vacuum domination. In the sudden transition approximation, the 
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scale factor is given by 



a(t) oc <^ ^ ' (5.2 

UI^a) exp(tAA-2/3) ift>itA. 

Matching both the scale factor and its derivative requires cumbersome shift terms, 
or order-one corrections to the matching time, hke the 2/3 appearing in the above 
equation. Our goal is only to understand the rough features of our plots, not to keep 
track of all factors of order one. For this purpose, it suffices to match the value of the 
scale factor but not its derivative at the transition time. We will do so in all formulas 
below. For the present case, the simplified version of the scale factor is 



^ { .2/3 .... (5-3) 



^^2/3 iit<tA 

//^exp{t/tA - 1) if t > U . 
By Eq. (2.2), the comoving radius of the causal patch is given by 



'4ty^- 3^1/3 if t <tA , 

t]^^ exp{-t/tA + 1) if t > U . 



Xpatch(t) OC <^ 1^3 . , ... (5-4) 



By Eq. (2.3), the comoving radius of the causal diamond is given by 



^3t^/^ if t <tA 

ty^exp(-t/tA + l) if t >tA . 



Xdia(t) 0^ S 1/3 (5-5) 



The "edge" of the causal diamond, where the future light-cone from the reheating 
surface meets intersects the boundary of the causal patch, occurs at the time 0.23tA [23]. 
Since this is approximately the same time at which the scale factor changes from power 
law to exponential growth, there is no need for another case distinction in Eq. (5.16) 
at this level of approximation. 

Since in this section we are assuming negligible spatial curvature, the comoving 
volume for the patch is 

K-'woc^ (5.6) 



^A exp(— 3t/tA + 3) if t > ^A 



while that for the diamond is 



Vf^it)oc\l'''' (5.7) 
4"-tA exp(-3t/tA + 3) if t > U 
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Now we are in a position to derive a probability distribution for A by counting observers: 



dp dp 

oc 

rflogio A <ilogio^ 



X J dthoUt)yc{t) (5.8) 



Recall from section 2.2 that we assume dp/ dlogiQ A oc A, and that here we are assuming 
^obs(^) is independent of A. 

Beginning with the causal diamond^ we see that 



dp 

oc A 



rflogio A 



3671 / dtfiohs{t)t + —tA / dthohs{t)exip{-3t/tA + 3) . (5.9) 

^0 ^ JtA 



If hohs{t) were constant in time, the bracketed terms would be proportional to t\ oc A~^ 
and we would obtain a distribution flat in log^g^- reality, nobs(^) has a peak at a 
time tpeak of order Gyr with a width that is also of order Gyr. This will serve to 
select a preferred value of A as follows. If tA <C tpeak, then only the second integral 
in Eq. (5.9) contributes, and this integral features an exponential suppression due to 
the rapid emptying of the de Sitter horizon during A-domination. If ^ ^peak, then 
only the first integral contributes, and its contribution will be independent of A. But 
there are more vacua with larger A than smaller A; this is encoded in the overall factor 
of A coming from the prior, which tend to push the probability toward larger values 
of A. Thus we conclude that the most favorable value of A is one where tpeak ^ 
(more precisely we conclude that tpeak ^ ^edge, but these are the same up to order-one 
factors). Similarly, the width of our distribution depends on the width of nobs(^): if ^on 
and tofT are the times when nobs(^) is at half-maximum, then the corresponding values 
of A will give the approximate la boundaries in the the log;Lo^ distribution [23]. 

The same analysis holds for the causal patchy with one modification. In Eq. (5.6), 
we see that for t < t\ the patch volume has some residual A dependence. So when 
^peak <^ ^A, the factor of A from the prior is partially cancelled by the factor of tA ^ 
A~^/^ in the comoving volume. The result is that the probability distributions using 
the patch are more tolerant of small values of A than those using the diamond. 

These estimates are confirmed by our plots for the probability distribution over 
A > 0, with Q = Qo and AA^ > fixed (the first plot in each figure). Fig. 2 shows 
the result for entropy production, where the most likely value of A is about IOAq. This 
corresponds to tpeak ^ 2 Gyr, so we expect that in the 5 Gyr delay time model of 
Fig. 5 the most likely value of A would be smaller by a factor of (2/7)^ ^ .08, and 
indeed we can see in that plot that the most likely value is now slightly smaller than 
Aq. With a 10 Gyr time delay, our estimate says that most likely value of A should be 
be (2/12)^ ^ .03 times that for entropy production, and in Fig. 8 we see that the most 
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likely value is down to nearly Aq/IO. Also, for the patch, in Figs. 11, 14, and 17 we see 
a braodening of the distributions compared to those of the diamond. 

The A < case has several important differences from the A > case. Instead 
of halting structure formation, a collapsing universe leads to a period of enhanced 
structure growth. However, the structures which grow during the collapse phase are 
much larger than the structures which grow in the early universe. The largest of these 
cannot cool and do not form stars. As a result, enhanced structure growth is actually 
a subdominant effect in explaining the difference between the probability distributions. 
Far more important is the difference in geometry for a A < universe, which we will 
now discuss. 

The scale factor or A < (and negligible curvature) is 



The comoving volume can be expressed in terms of hypergeometric functions. Once 
again the behavior of the causal diamond is easier to estimate. A reasonable approxi- 
mation is 



For A < 0, the "edge" time for the causal diamond is tgdge = ^tx/Z (coinciding with the 
turnaround time for the scale factor). Since 7r/3 > 0.23, the probability distribution for 
A < peaks at a value of |A| which is higher by a factor of (7r/(3x0.23))^ ?^ 21 than that 
for A > with the same observer model. This evident when comparing, for example, 
the first plot of Fig. 2 with that of Fig. 3. It is also manifest in the distributions over 
both positive and negative values of A, such as the first plot in Fig. 4. 

According to our approximate equation, the diamond's volume at t = tgdge should 
be larger for A < 0, by the same factor of 21, for equal |A|. Indeed, the first plot 
of Fig. 4 shows that height of the peak for A < is larger than for A > by about 
this amount. We can also compare the integrated probabilities for each sign of A, 
p{K < 0)/p{A > 0). For the entropy production model, this ratio is 25, which we can 
attribute to the enhancement of diamond volume for those universes with A < where 
tedge coiucidcs with the peak of entropy production. In the 5 Gyr and 10 Gyr time 
delay models the ratio is 13 and 12.4, respectively. The ratio is smaller than in the 
entropy model, because about half of the range of log^^o ^ ^^^^ probability in the 
time delay models because those universes crunch before t = tdeiay 

Similar conclusions hold for the causal patch when A < 0. As for A > 0, small 
values of |A| are less suppressed by the causal patch than by the diamond. This 






(5.11) 
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broadens the probabihty distribution so much that our range of log^^o 1^1 i^iisses some 
of the probabihty for smaU |A|, as one can see in the first plot of Figs. 12, 15, and 18. 
This means we cannot accurately compute certain quantities, such as the ratio p{A < 
0)/p{A > 0), for the patch. We can make some qualitative statements, though. For 
the entropy production model, we expect that the ratio should be of the same order 
for the patch as for the diamond. For the time delay models, the ratio should decrease 
for the same reason that it decreased in the diamond: the universe crunches too early 
in part of the parameter space. However, the decrease should be smaller in the patch 
than it was in the diamond. That is because the universes which are crunching "too 
early" are the ones with large |A|, but as discussed above this is precisely the region of 
parameter space which is relatively de-emphasized in the patch as compared with the 
diamond. 

5.2 Varying spatial curvature only 

Now we will consider the implications of varying spatial curvature, while keeping A (and 
Q) fixed at its observed value (or at minus this value; see below). How is structure 
formation and observer formation affected by period of curvature domination following 
matter domination, beginning at t tc- In the sudden transition approximation, the 
scale factor is 

ift <te , 

a{t) = It if tc < i < U , (5.12) 

\t\exp{t/t\ — 1) if ^ ^ • 
By Eq. (2.2), the comoving radius of the causal patch is given by 

Xpatch(t) oc i 1 + In ^ if tc < t < U , (5.13) 

[exp(-t/tA + l) if tA <t . 

To compute the radius of the diamond, we need to first find the edge time tgdge where 
the the forward and backward lightcones meet. We will assume (and later check for 
consistency), that this time is during the period of curvature domination. During 
curvature domination, the radius of the forward lightcone is 

Xforward(i) = 3 + lu ^ (5.14) 

^c 

while that of the backward lightcone is given by Xpatch. Setting the two equal, we find 
that 

^edge = e~^VtctA . (5.15) 
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One can easily see that tc < tedge < when t\ ^ tc^ which is the hmit we are most 
interested in. So we have, by Eq. (2.3), 



if t < tc , 

if ^ ^ ^ ^edge 1 
1 + In ^ if tedge <t<tA , 

^exp(-t/tA + 1) iftA<t. 



3 + lnf 

In^ 



(5.16) 



When computing the comoving volume, we must keep in mind that space is curved; 
the comoving volume scales like tt sinh(2x)— 27rx. For large x, this reduces to tt exp(2x)/2, 
while for small x we recover 47rx^/3. As an approximation, we will use the exponential 
form during the period of curvature domination for both the patch and the diamond, 
as well as during matter domination for the patch. We will use the flat form during 
matter domination for the diamond, and during A domination for both the patch and 
the diamond. This is a good approximation in the regime where the scales are widely 
separated. We flnd 



f (|)%xp(8-6(t/te)'/') if t <tc , 



7re 
2 

4:77 



(5.17) 



exp(— 3t/tA + 3) 



for the patch and 



367r(t/tc) 



7re 
2 

2 

7re 
2 

4:77 



if U < t . 

if t <C tc , 

if tc ^ t < tedge 
if ^edge ^ t <^t\ 



exp(-3t/tA + 3) if U < ^ • 



(5.18) 



for the diamond. 

We can count observers in the causal patch or diamond by integrating the comoving 
volume against hohs{t)^ the rate at which observations are made per unit time and co- 
moving volume. Note that the scale factor in Eq. (5.12) contains an explicit dependence 
on tc during matter domination. This means that the comoving observer density also 
depends on tc through a trivial overall multiplicative factor. But the physical observer 
density is independent of tc during matter domination. It will be clearer to work with 
a variable that makes this explicit: 



^obs = riohs{t)/tc 



(5.19) 
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For sufficiently large AA^ at fixed A, curvature never plays a dynamical role, be- 
cause < tc- In this regime the number of observers is independent of AA^, and the 
probability distribution over AA^ is given by the prior: 

dp 1 



oc 



(5.20) 



dAN (60 + AA^)4 

This can be seen in the 3rd plot of Figs. 2, 5, 8, 11, 14, and 17. In all figures, we 
have used this analytic formula to continue our results beyond the displayed upper 
limit of AA^ = 1 when we calculate probabilities and confidence intervals. The (prior) 
suppression of large AA^ means that there is no runaway in that direction. 

Let us turn to the more interesting case where t\ > tc- (In general, there would be 
two subcases, > ^peak > tc and t\ > tc > tpeak- However, in this subsection we are 
discussing only the variation of curvature, in a universe otherwise like ours. For all our 
observer models, tpeak is comparabale to in oni universe, so the second subcase does 
not arise.) For the causal diamond^ we find 

dp 1 



dAN 



oc 



dt hohs{t) — 



(5.21) 



(60 + AA^)4 

The geometric factor of oc exp(— 3AA^), along with the prior distribution, favors 
curvature. 

However, for sufficiently large curvature, dynamical effects become important and 
cut off the probability distribution. With all observer models we consider, the number 
of observations will decrease if structure formation is suppressed. This effect becomes 
severe if not even the earliest structures can form, i.e., if the density contrast, a(M, t), 
never reaches unity even for the smallest mass scales that can cool efficiently, Mmin- 
Let tvir denote the time when these structures would have formed in a universe without 
curvature. By Eq. (3.7), for tc <C tvir these structures will be suppressed like 

4/3] 

/ ^vir ^ 

exp 



exp 



1.68 



V2a{M, t) 



-B 



tr 



(5.22) 



(Here B is some order-one coefficient and tyir depends weakly on the mass scale.) This 
corresponds to a doubly-exponential suppression of the probability distribution over 
AA^, Eq. (5.21). In our universe, the value of tc corresponding to AA^ = is somewhat 
larger than tvir, and the suppressed regime is reached close to the lower end of our 
parameter space, AA^ = —3.5. This can be seen in the 3rd plot of Figs. 2, 5, and 8. 

Now let us consider the same regime, t\ > tc^ in the causal patch. Using Eq. (5.17) 
we find 
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This time, a geometric factor of tc appears in the numerator, suppressing curvature. 
Below AA^ ^ —3.5, the stronger, dynamical suppression discussed for the diamond 
sets in: the failure of structure formation. This behavior is reflected in the 3rd plot of 
Figs. 11, 14, and 17. 

In all plots and all observer models, our failure thus far to detect curvature is 
not surprising; it is perfectly compatible with the predicted probability distribution. 
However, upcoming experiments such as Planck will be more sensitive to small amounts 
of spatial curvature. In the spirit of Ref. [40], let us use our probability distribution to 
calculate the probability that curvature will be detected in the future. 

The current la bound on curvature from WMAP5+BA0+SN [50] is Qk = -0.00501^060- 
This corresponds roughly to the Gaussian distribution 

oc exp nn^i\9 • (^•24) 



dQk V 2(.0061)^ 

Our convention for AA^ = is the upper la bound, Qk = .0011. Since Qk ^ 

exp(— 2AA^), we can convert our probability distribution for AA^ into one for which 
in the regime AA^ ^ — 1 looks like 

dp 1 1 , 

^ 7^-. V4 ^ (5-25) 



dQk ( ac\ \ 1 1^ .0011 



60+lln^) 



Because we are assuming an open universe, the probability for Vt^ < Q vanishes. The 
current experimental bound is so strong that we do not need a more detailed form. If 
future experiments reach a sensitivity level of Af^/^^, then we will be able to detect the 
openness of the universe if Vtk > /Wt^. The probability for this to occur is 

pi^k > An,) = r dQ, (5.26) 

ffifc+-0050)^ 



dn, ^^p{- 2(.oo6i)^ ; 



which is normalized so that p{^k ^ 0) = 1. Then we flnd p{^k ^ 10~^) ^ 0.033, which 
might be realized in the near future, and p{Qk ^ 10~^) ^ 0.088, which is roughly the 
limit of achievable sensitivity. 

5.3 Varying both the cosmological constant and curvature 

When both the curvature and the cosmological constant vary, we can use a similar 
analysis to obtain a qualitative understanding of the probability distributions. Again, 
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we will distinguish different regimes determined by the relative sizes of ^a, ^c, and tpeak- 
We will now have to consider both positive and negative values of the cosmological 
constant. 

The cases with tA <^ tc correspond to negligible curvature. In this regime, the joint 
probability distribution is the direct product of the probability distribution over A (with 
negligible curvature, see Sec. 5.1) and the prior probability distribution over AA^ (see 
Eq. (2.7). which we have already examined. We can also immediately dispense with 
the cases ^a <^ ^peak^ there will always be pressure toward smaller |A| either because of 
suppressed structure formation and a small causal diamond/patch (A > 0) or because 
the universe has already collapsed (A < 0). 

The case tc <C tpeak <C ^a, for A > 0, was essentially discussed in the previous 
subsection, except that A was held fixed there. But since tpeak is essentially independent 
of A and AA^, A can vary only in the direction of smaller values while preserving the 
above inequality. Therefore, in this regime, the joint probability distribution is the 
direct product of the probability distribution over Lambda given in Sec. 5.1 and the 
distribution over AA^ derived in Sec. 5.2. Moreover, the doubly-exponential suppression 
of structure formation by curvature is unaflFected by the sign of A. Therefore, the joint 
distribution over AA^ and negative values of A is given by the direct product of the 
AA^ distribution from Sec. 5.2 and the negative A distribution from Sec. 5.1. 

There is one subtlety, which is apparent when comparing the 3rd plot of Fig. 2 with 
that of Fig. 3. In Fig. 3 the probability is increasing toward the boundary of the plot 
at AA^ = —3.5, whereas in Fig. 2 the suppression of structure formation has caused the 
probability distribution to decrease at the same value of AA^. We have already argued 
above that structure suppression due to large curvature works the same way for A < 
as for A > 0, so we must reconcile the apparent discrepancy. First, we should note 
that the probability does not increase indefinitely in toward small AA^, our plot range 
is merely inadequate to show the eventual peak and subsequent decrease. We must 
explain why the suppression does not happen at the same value of AA^ in the positive 
and negative cases, for equal values of |A|. 

The answer lies in the geometry of the causal diamond, specifically in the diflFerence 
in edge times, tedge, for positive and negative A. As we saw in Eq. (5.15), tgdge will 
actually decrease as curvature is increased while A remains constant. It turns out that 
^edge ^ ^peak ^OT A = Aq, Q = Qo^ and AA" = —3.5. However, tgdge for A < is always 
of order ^a (to be precise, it is equal to the time of maximum expansion). In particular, 
^edge ^ ^peak ioT Q = Qq^ A = — Aq and all valucs of AA^. The entropy production 
curves are nearly identical when the sign of A is fiipped (since we are safely in the limit 
^ ipeak), but in the A < case the tail of the entropy production lies entirely within 
the growing phase of the causal diamond. The extra boost in probability granted by 
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this effect means that more curvature is required to suppress the probability for A < 0, 
even though structure formation is suppressed by an amount independent of the sign 
of A. 

Let us turn to the case 

Wk < < U; A > . (5.28) 

Structure formation is uninhibited in this regime, so only prior probabihties and geo- 
metric effects control the joint probability distribution. The comoving volumes of the 
patch and diamond are given in Eqs. 5.17 and 5.18. Combining this with the prior 
distribution, we find for the causal diamond: 

oc / dt h{t) t . (5.29) 



rflogioArfAA^ (60 + A A^)^ 

The integral is independent of A and AA^, so the probability distribution is governed 
by the prefactor. The pressure is toward larger A and smaller AA^ suppresses any 
hierarchy between the timescales appearing in the above double inequality. Indeed, in 
our universe, the inequality is either violated or approximately saturated. 

In the same regime, Eq. (5.28), the causal patch introduces the geometric factor 
t\/tc. This leads to a somewhat different result: 

There is a pressure toward smaller AA^, suppressing a large hierarchy tpeak <^ ^c- This is 
qualitatively the same as for the diamond, though the pressure is stronger. But unlike 
in the diamond case, small values of the cosmological constant are unsuppressed: at 
fixed AA^, the probability is flat in log^g ^- This is a runaway problem: Without a 
lower bound on log^g the probability distribution is not integrable in the direction 
of small log;Lo ^- Such a lower bound may well be supplied by the finiteness of the 
landscape. It need not be particularly close to the observed value, since the probability 
is flat. (We are about to discover, however, that a more severe runaway problem arises 
for the causal patch in the analogous case with negative A.) 

Now we analyze the same case, Eq. (5.28), but with A < 0. Our conclusions for 
the causal diamond remain the same, with A replaced by |A|, since neither nobs(^) nor 
V^^^{t) depend on the sign of A when t ^ tc ^ t\. Turning to the causal patchy we 
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note that the scale factor can be approximated by 



a{t) = It/, sin (^^^^ iitc <t <t2-tc , (5.32) 
it2-tf'^tl^^ iit2-tc<t <t2 . 



ti = tAsin"^ ( — ) - ^c (5.33) 



in the simphfied sudden transition approximation, where 
and 

t2 = TTtA - 2ti . (5.34) 

Since tpeak <C tc, we need only compute the comoving volume in the regime t <C tc- For 
the comoving radius, we find 

^^^^^^^^=1 ^ (5.35) 

+ / -TJTTn (5.36) 
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= + 1, tan(t./2t.-W2t. + W2tA) 

V^c/ tan(t72tA + ti/2tA) ^ ' 

fa6 + 21n— . (5.38) 

(In the final line we used t ^ tc ^ t\.) Using V{x) ^ ^w{'^x)i the comoving volume 
is approximately 

Vr'^'it) ^ (^-^^ , (5.39) 

yielding the probability distribution 

d^P exp (-9AA^) 

rflogio |A| dAN |A| (60 + AN)^ ' ^ ' 

Again we find the causal patch leads to runaway towards small values of |A| in the 
presence of curvature. The runaway is stronger, like |A|~^ for fixed AA^, than the fiat 
distribution over log;Lo 1^1 that we found in the case of positive A. The preference for 
small |A| is evident in the 8th plot of Figs. 12, 15, and 18, where the probability is 
concentrated in the small AA^ and small A corner of the plots. 
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This runaway implies one of two things. Either the causal patch is the wrong mea- 
sure, at least in regions with nonpositive cosmological constant.^ Or our vacuum has 
the smallest cosmological constant of all (anthropic) vacua in the landscape. The latter 
possibility is quite interesting, since it offers a potential first-principles explanation of 
the actual magnitude of |A| (and by correlation, many other unexplained hierarchies) 
in terms of a fundamental property of the underlying theory: the size of the landscape. 
(A variant of this possibility is that the landscape does contain vacua with smaller A 
but the prior probability distribution for AA^ is weighted towards large values, so that 
curvature is negligible in all pocket universes.) 

5.4 Varying the density contrast only 

Unlike A and AA^, Q has no geometric effects (at least in the homogeneous approxi- 
mation we employ). It does have dynamical implications, however. The effects of Q on 
the observer production rate riobs differ somewhat depending on the observer model. 
Since all three models involve star formation, let us begin by considering how it is 
affected by Q. Q enters the structure formation equations in the combination QG{t)^ 
and G{t) oc t^^^ during matter domination. Thus, the time of structure formation (and 
thus, star formation) scales as tyir Q~^^^ - 

Let us discuss first the effects of increasing Q. Star formation that happens earlier 
also happens faster. The star formation rate for a single halo, Eq. (3.8), scales like 
the inverse of the gravitational timescale of the halo. Since tgrav is proportional to the 
virialization time of the halo, the star formation rate scales as p^(tpeak) ^ Q^^^ • By 
similar arguments, the width of the star formation rate also scales like At oc Q~^^^. 
Thus, one might expect that the total integrated star formation rate stays constant as 
one varies Q. However, there is an effect which is not seen by this rough analysis. As Q 
increases, the range of halo masses which are able to cool and form stars also increases. 
The lower mass limit, which comes from the requirement that halos be hotter than 
10^ X to start, is smaller at earlier times. And the upper mass barrier, which arises 
from radiative cooling failure, completely disappears at sufficiently early times (prior to 
about 1 Gyr), because Compton cooling becomes efficient [51].^° Thus, a larger mass 
fraction forms stars. Numerically, we find that these two effects combine to make the 

^These regions were defined more sharply in Ref. [19,21] ("hat domains" and "singular domains"). 
If the causal patch is viewed as arising from an analogy with the AdS/CFT correspondence via a 
global/local duality, it is not implausible that it may apply only in the "eternal domain" (roughly, 
regions with positive A). 

Dominant Compton cooling, however, corresponds to a drastic change of regime and may be 
catastrophic. This possibility is explored in Ref. [25]. 
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integrated star formation grow roughly logarithmically with Q\ 

J oclogioQ . (5.41) 

In the entire analysis, so far, we have assumed that structure formation is not disrupted 
by curvature or vacuum energy; this assumption is certainly justified in the case at hand, 
where A and AA^ are held fixed and set to the observed values. 

There is a limit to how far we can increase Q given the model of star formation 
outlined in Sec. 3.1: for large enough Q, structure formation happens before recom- 
bination. Even though dark matter halos can form prior to recombination, baryons 
cannot collapse into galaxies. One expects that if there are large dark matter halos 
already formed at the time of recombination, then there will be a huge surge in star 
formation as the baryons fall into them after decoupling from the photons. Our star 
formation code takes this into account in a very simplistic way: star formation which 
would have happened prior to recombination is delayed until after recombination, and 
at that time there is a large and essentially instantaneous spike in the star formation 
rate, after which it drops back down to normal levels. (The code knows nothing of the 
actual interactions between baryons and photons.) There may be a phenomenological 
reason why such instantaneous star formation is not hospitable for observers, but at 
the very least this is a change in regime for the star formation rate. At Q = lO^Qo? 32% 
of the total star formation is created in this spike, up from 13% at and 0.1% 

at lO^-^Qo, and this percentage will continue to rise if Q is increased. This motivates 
our cut-ofl[ at lO^Qo- As seen in Sec. 4, the behavior of the probability distribution in 
Q near the upper boundary is fairly mild (at worst a logarithmic growth when A = Aq 
and AA^ > are fixed), so our results should not change dramatically if the cut-off is 
extended. 

If Q is decreased compared to the observed value, then the range of halo masses 
that can cool efficiently shrinks. It soon disappears altogether, for Q ^ 10~^Qq. There 
are no stars, and no observers, for smaller values of Q. This cuts off the probability 
distribution over Q independently of AA^, A, the measure, and the observer model, so 
we will not discuss this case further. 

Let us now estimate the probability distribution for Q > Qq. We begin with the 
time delay observer models. The time delay is held fixed as Q varies. (The assumption 
underlying these models is that tdeiay is determined, at least in a substantial part, by 
dynamics unrelated to Q, AA^, or A). In our universe, tyir is already somewhat smaller 
than 5 Gyr, and for larger tvir will be entirely negligible compared to the time 
delay, as will the width of the star formation rate. Thus, observers will live at a time 
given approximately by tdeiay Using a fiat prior, dp /dlogiQ Q ^ 1^ and the logarithmic 
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growth of the integrated SFR described, one expects 




€10 



OC / dtp^{t - tdelayWdtdelay) 



(5.42) 



OC logio Q ■ 



(5.43) 



This distribution holds in both the patch and the diamond and with either a 5 or 10 Gyr 
time delay, as is evident in the 2nd plot of Figs. 5, 8, 14, and 17. 

There are additional complications with entropy production, and these complica- 
tions are such that we are only able to give a very rough qualitative account of the 
probability distribution; numerical calculations are essential to finding the true shape. 
The first complication is that stars burn for a long time after they are created. This 
makes the entropy production rate significantly broader than the star formation rate, 
and the result is that we cannot reliably approximate all of the entropy as being created 
at a single time. 

A second complication is that earlier star formation means earlier entropy produc- 
tion, because much of the entropy is produced by fast burning massive stars. The peak 
of the entropy production rate can happen at arbitrarily early times, unlike in the time 
delay models, which ensured that the peak of hohs{t) happens after the time tdeiay This 
has an important consequence in the comparison of the patch and the diamond. As Q 
is increased, more entropy production happens at earlier times, when the diamond is 
small but the patch is large. Indeed, comparing the 2nd plot of Figs. 2 and 11, we see 
that for the causal diamond^ dp/dlog^Q Q is maximal at Q = Qo] whereas for the causal 
patchy it increases monotonically with log;Lo Q- 

The third complication is the eflFect of the dust temperature. The interstellar dust 
temperature is a complicated function of both the halo virialization time and the actual 
emission time (Eq 3.16). However, we can say qualitatively that the effect of variation 
in dust temperature is to suppress the entropy production of early-time stars relative 
to late-time stars. This is why, for example, in the 2nd plot of Fig. 11, which uses 
the causal patch measure, the probability distribution begins to fiatten out for large Q 
rather than continuing to increase in a manner similar to that of the time delay model 
in Figs. 5 and 8. 

5.5 Varying the density contrast and the cosmological constant 

When both A and Q are allowed to vary, we can combine the analysis of Sec. 5.1 and 
the previous subsection to understand the probability distribution. In Sec. 5.1, we 
concluded that the most likely value of A for fixed Q was determined by the condition 
^ ^peak, whcrc tpgak IS the peak time of the model-dependent nobs(^)- In the previous 
subsection we found that depending on the observer model, tpeak can depend on Q. 
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In the time delay observer model, tpeak ^ ideiay for all relevant values of Q. Indeed, 
in the 9th plots of Figs. 5, 8, 14, and 17, we see that the most likely value of A is essen- 
tially independent of Q. Additionally, the probability increases with Q proportional to 
fogio Q-} due to the increase of total star formation discussed in Sec. 5.4, and this effect 
is visible in the same plots. The only difference between the diamond and the patch in 
these models is in the broadness of the distribution in the A direction, which was also 
discussed in Sec. 5.1. 

In the entropy production observer model, tpeak depends strongly on Q\ tpeak ^ 
This leads to the relation A oc for the most likely value of A at a given 
Q. In this 9th plot of Figs. 2, 3, 11, and 12 we see this trend in the slope of the 
contour lines toward large A and large Q. This looks like a runaway, but there is a 
cut-off {Q ^ lO^Qo) coming from our requirement that star formation happen after 
recombination (see Sec. 5.4). 

5.6 Varying the density contrast and spatial curvature 

If curvature is small, tc ^ tvir, curvature has neither dynamical nor geometric effects 
for A = Aq. Large curvature, however, can disrupt structure formation and thus star 
formation. This is encoded in Eq. (5.22), which is valid for tc <C tvir (and was derived 
from Eq. (3.7)). When Q varies simultaneously with AA^, we must take into account 
the Q-dependence of tyir, ^vir ^ Q~^^^'. 



(Recall that B is an order-one coefficient; (7 is a different order-one coefficient that 
weakly depends on the mass scale.) For increased Q a smaller tc is required to halt 
structure formation. This yields the following relation between log^g Q AA^crit, the 
value of AA^ at which structure formation is completely suppressed: 



We found in Sec. 5.2 that AAcrit ^ —3.5 when Q = Qo, which is already at the edge of 
the parameter range we consider. For larger values of Q the value of AA^ necessary to 
significantly suppress structure formation is outside that range. However, the highest 
value of Q we consider, lO^Qo, shifts AAcrit only by -2.3. 

The 7th plot of each figure shows how AAcrit depends on Q for fixed A = Aq. In 
the time delay models. Figs. 5, 8, 14, and 17, one clearly sees the contours of constant 
probability following a slope given approximately by Eq. (5.45). The corresponding 
plots in the entropy production models. Figs 2 and 11, look a bit different in their gross 
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(2 logio e) AA'crit + logio Q = const. 



(5.45) 
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features, owing to the complications discussed in Sec. 5.4, but the same trend in AA^crit 
is visible. 

All of the above considerations hold also for A = — Aq for the causal diamond^ 
which can be seen in the 7th plot of Figs. 3, 6, and 9. (The causal patchy of course, 
has a runaway problem which is dominant. See Sec. 5.3.) Since Umedge is larger for 
A < for equal |A| (see Sec. 5.1), the geometry of the diamond allows A < to tolerate 
more curvature. This means AA'crit is smaller for A < 0, but does not change the fact 
that it scales according to Eq. (5.45). It is this geometric effect which also leads to 
the enhancement of probability for small AA^ visible in the 3rd and 6th plots of those 
same figures. As our discussion has made clear, this does not indicate a new runaway 
problem; our displayed parameter range is merely too small to see the suppression at 
small AA^ from structure suppression. 

5.7 Marginalizing 

Let us combine our previous observations to understand the behavior of the probability 
distributions in which one or more parameters have been integrated out. 

Integrating out AA^ For the causal diamond^ the only consequence of varying AA^ 
is that for small AA^ structure formation is interrupted. This only happens over a very 
small part of the full AA^ parameter space, so integrating out AA^ does not have a large 
effect on the distributions for Q and A, independently of the observer model. This is 
evident in the similarities between the 9th and 12th plots in Figs. 2, 3, 5, 6, 8, and 9. 

For the causal patchy however, the runaway toward small |A| discussed in Sec. 5.3 
means that the smallest values of |A| are greatly favored after AA^ is integrated out. 
Restricting to A > 0, the effect is weaker, but can still be seen in the comparison of the 
9th and 12th plots of Figs. 14 and 17. For A < 0, it is stronger, as seen in comparing 
the 9th and 12th plots in Figs. 12, 15, and 18. 

Integrating out Q Th effect of integrating out Q depends on the observer model. 
In the time delay models, there is very little change in the distributions of A or AA^ 
after integrating out Q, as one can see by comparing the 8th and 11th plots in each 
of Figs. 5, 6, 8, and 9. For A, this is easily understood since most of the Q parameter 
space has tpeak = ^deiay, ^ud thus the analysis is largely independent of Q (see Sec. 5.4). 
For curvature, integrating out Q should increase the relative probability for small AA^ 
compared with Q = Qo due to relaxed restrictions on structure formation. Comparing 
the 3rd and 6th plots of those same figures, we see that small AA^ is relatively more 
favored in the 6th plot. 

For entropy production, though, there are some more significant effects. After 
integrating out larger values of both | A| and curvature are generally favored. There 
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are two reasons for this. The first is that the prior probabilities for |A| and curvature 
want both of them to be large. With Q fixed, this tendency for largeness is countered 
by the suppression of structure formation when A or curvature dominate before tpeak- 
But once Q is integrated out, this restriction is relaxed. The second reason is that the 
large Q region of parameter space, which is where A and curvature are allowed to be 
large, has more star formation and hence more entropy per comoving volume. So the 
large Q region of parameter space contributes more to the resulting distributions of A 
and AA^ than the small Q region after Q is integrated out. 

It is difficult to find an analytical formula for the resulting probability distributions 
due to the complications mentioned in Sec. 5.4. But, qualitatively, we can see this shift 
toward larger |A| and smaller AA^ in comparing the 8th and 11th plots in each of 
Figs. 2, 3, and 11 (Fig. 12 is not included on this list because the runaway problem 
toward small |A| is the dominant feature for A < in the causal patch). The same 
figures also show a preference for large |A| in the 4th plot. The probability fiattens or 
turns around for the largest values of |A| because of the upper limit we have imposed 
on Q. 

Integrating out A When integrating out A, the most important distinction is whether 
we use the diamond or the patch. For the causal patchy as discussed in Sec. 5.3, there 
is a runaway problem toward small A. The probability is concentrated in the region of 
smallest |A|. It is possible that this value is not much smaller than the observed value, 
so this problem is not necessarily fatal. When computing confidence intervals requires 
the assumption of a lower bound on |A| (which is always highlighted in the captions), 
we use the lower end of the displayed range, IO^^Aq. 

This issue does not arise for the causal diamond. In the time delay models there is 
almost no difference between leaving A fixed and integrating it out in both the positive 
and negative cases. We can see this by comparing the 7th and 10th plots of Figs. 5, 6, 
8, and 9. The cases where A is integrated over both positive and negative values are 
covered in Figs. 7 and 10. They are almost identical to the negative A case because 
the A < vacua contribute the bulk of the probability. 

With entropy production there is a qualitative change in the distribution of Q when 
going from A fixed to A integrated out. This change is most clearly seen in comparing 
the 2nd and 5th plots of Figs. 2 and 3. When A is fixed, the Q distribution becomes 
fiat for large Q, and also has a peak at small Q Qq). Recall from Sec. 5.4 that the 
fiatness for large Q is attributable to the changing dust temperature, and the shape 
of the diamond gives extra suppression for large Q that effectively creates the peak 
at small Q. After integrating out A, both of these effects disappear. (In the plots 
referenced above, it looks like large Q is actually still suppressed, but this is due to a 
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new effect: the finiteness of our parameter range in A). 

The reason why these effects disappear is because at fixed A and high Q the dia- 
mond is sensitive to the long tail of the entropy production rate; its maximal comoving 
volume is several Gyr after tpeak- Integrating over A is effectively like finding the op- 
timal value of A for each Q. The optimal value for A is the one where the maximal 
height diamond volume is centered on the peak of the entropy production rate, which 
means resulting Q distribution is insensitive to the tail of the entropy production rate. 

Acknowledgments 

We are grateful to B. Freivogel, R. Harnik, and J. Niemeyer for discussions. This work 
was supported by the Berkeley Center for Theoretical Physics, by a CAREER grant 
(award number 0349351) of the National Science Foundation, by fqxi grant RFP2-08-06, 
and by the US Department of Energy under Contract DE-AC02-05CH11231. 

References 

[1] R. Bousso and J. Polchinski, "Quantization of four-form fluxes and dynamical 

neutralization of the cosmological constant," JHEP 06 (2000) 006, hep-th/0004134. 

[2] S. Kachru, R. Kallosh, A. Linde, and S. P. Trivedi, "De Sitter vacua in string theory," 
Phys. Rev. D 68 (2003) 046005, hep-th/0301240. 

[3] A. N. Schellekens, "The landscape 'avant la lettre'," arXiv: physics/0604134. 

[4] S. Weinberg, "Anthropic bound on the cosmological constant," Phys. Rev. Lett. 59 
(1987) 2607. 

[5] J. Polchinski, "The cosmological constant and the string landscape," hep-th/0603249. 

[6] R. Bousso, "TASI Lectures on the Cosmological Constant," Gen. Rel. Grav. 40 (2008) 
607-637, arXiv : 0708 . 4231 [hep-th] . 

[7] R. Bousso and S. Leichenauer, "Star formation in the multiverse," Phys. Rev. D79 
(2009) 063506, arXiv: 0810. 3044 [astro-ph] . 

[8] A. Linde, D. Linde, and A. Mezhlumian, "Nonperturbative amplifications of 

inhomogeneities in a self-reproducing universe," Phys. Rev. D 54 (1996) 2504-2518, 
gr-qc/9601005. 

[9] A. H. Guth, "Inflation and eternal inflation," Phys. Rep. 333 (1983) 555, 
astro-ph/0002156. 



-65- 



[10] A. H. Guth, "Inflationary models and connections to particle physics," 
astro-ph/0002188. 

[11] A. H. Guth, "Inflation," astro-ph/0404546. 

[12] M. Tegmark, "What does inflation really predict?," JCAP 0504 (2005) 001, 
astro-ph/0410281. 

[13] R. Bousso, B. Freivogel, and I.-S. Yang, "Boltzmann babies in the proper time 
measure," Phys. Rev. D77 (2008) 103514, arXiv: 0712. 3324 [hep-th] . 

[14] R. Bousso and B. Freivogel, "A paradox in the global description of the multiverse," 
JHEP 06 (2007) 018, hep-th/0610132. 

[15] D. N. Page, "Is our universe likely to decay within 20 billion years?," hep-th/0610079. 

[16] D. N. Page, "Return of the Boltzmann brains," hep-th/0611158. 

[17] A. Linde, "Towards a gauge invariant volume-weighted probability measure for eternal 
inflation," JCAP 0706 (2007) 017, arXiv: 0705. 1160 [hep-th]. 

[18] A. H. Guth, "Eternal inflation and its imphcations," J. Phys. A40 (2007) 6811-6826, 
hep-th/0702178. 

[19] R. Bousso, "Complementarity in the Multiverse," arXiv: 0901 .4806 [hep-th]. 

[20] R. Bousso, B. Freivogel, and L-S. Yang, "Properties of the scale factor measure," 
arXiv: 0808. 3770 [hep-th]. 

[21] R. Bousso and I.-S. Yang, "Global-Local Duality in Eternal Inflation," 
arXiv: 0904. 2386 [hep-th]. 

[22] R. Bousso, "Holographic probabilities in eternal inflation," Phys. Rev. Lett. 97 (2006) 
191302, hep-th/0605263. 

[23] R. Bousso, R. Harnik, G. D. Kribs, and G. Perez, "Predicting the cosmological constant 
from the causal entropic principle," Phys. Rev. D 76 (2007) 043513, hep-th/0702115. 

[24] B. Freivogel, "Anthropic Explanation of the Dark Matter Abundance," 
arXiv: 0810. 0703 [hep-th]. 

[25] R. Bousso, L. J. Hall, and Y. Nomura, "Multiverse Understanding of Cosmological 
Coincidences," arXiv: 0902. 2263 [hep-th]. 

[26] J. Garriga and A. Vilenkin, "Holographic Multiverse," JC^P 0901 (2009) 021, 
arXiv: 0809. 4257 [hep-th]. 



-66- 



[27] R. Bousso, B. Freivogel, and M. Lippert, "Probabilities in the landscape: The decay of 
nearly flat space," Phys. Rev. D74 (2006) 046008, hep-th/0603105. 

[28] L. Mersini-Houghton and F. C. Adams, "Limitations of anthropic predictions for the 
cosmological constant A: Cosmic Heat Death of Anthropic Observers," Class. Quant. 
Grav. 25 (2008) 165002, arXiv: 0810. 4914. 

[29] F. Denef and M. R. Douglas, "Distributions of flux vacua," JHEP 05 (2004) 072, 
hep-th/0404116. 

[30] J. M. Cline, A. R. Frey, and G. Holder, "Predictions of the causal entropic principle for 
environmental conditions of the universe," arXiv: 0709.4443 [hep-th] . 

[31] B. Bozek, A. J. Albrecht, and D. Phillips, "Curvature Constraints from the Causal 
Entropic Principle," arXiv: 0902. 1171 [astro-ph. CO] . 

[32] D. Phillips and A. Albrecht, "Effects of Inhomogeneity on the Causal Entropic 
prediction of Lambda," arXiv : 0903 . 1622 [gr-qc] . 

[33] L. Hernquist and V. Springel, "An analytical model for the history of cosmic star 

formation," Mon. Not. Roy. Astron. Soc. 341 (2003) 1253, arXiv:astro-ph/0209183. 

[34] M. P. Salem, "Negative vacuum energy densities and the causal diamond measure," 
arXiv: 0902.4485 [hep-th]. 

[35] L. Susskind, L. Thorlacius, and J. Uglum, "The stretched horizon and black hole 
complementarity," Phys. Rev. D 48 (1993) 3743, hep-th/9306069. 

[36] R. Bousso, B. Freivogel, and I.-S. Yang, "Eternal inflation: The inside story," Phys. 
Rev. D 74 (2006) 103516, hep-th/0606114. 

[37] J. B. Hartle and M. Srednicki, "Are we typical?," Phys. Rev. D 75 (2007) 123523, 
arXiv: 0704. 2630 [hep-th]. 

[38] J. Garriga and A. Vilenkin, "Prediction and explanation in the multiverse," 
arXiv: 071 1.2559 [hep-th]. 

[39] D. N. Page, "The Born Rule Dies," arXiv: 0903.4888 [hep-th]. 

[40] B. Freivogel, M. Kleban, M. Rodriguez Martinez, and L. Susskind, "Observational 
consequences of a landscape," JHEP 03 (2006) 039, arXiv:hep-th/0505232. 

[41] D. Schwartz-Perlov and A. Vilenkin, "Probabilities in the Bousso-Polchinski 
multiverse," JC^P 0606 (2006) 010, hep-th/0601162. 

[42] D. Schwartz-Perlov, "Probabilities in the Arkani-Hamed-Dimopolous-Kachru 
landscape," hep-th/0611237. 



-67- 



[43] K. D. Olum and D. Schwartz-Perlov, "Anthropic prediction in a large toy landscape," 
arXiv: 0705. 2562 [hep-th] . 

[44] D. Schwartz-Perlov, "Anthropic prediction for a large multi-jump landscape," JCAP 
0810 (2008) 009, arXiv: 0805 . 3549 [hep-th]. 

[45] R. Bousso and R. Harnik (to appear) . 

[46] M. Tegmark, A. Aguirre, M. Rees, and F. Wilczek, "Dimensionless constants, 
cosmology and other dark matters," Phys. Rev. D73 (2006) 023505, 
arXiv:astro-ph/0511774. 

[47] W. H. Press and P. Schechter, "Formation of galaxies and clusters of galaxies by 
selfsimilar gravitational condensation," Astrophys. J. 187 (1974) 425-438. 

[48] C. G. Lacey and S. Cole, "Merger rates in hierarchical models of galaxy formation," 
Mon. Not Roy. Astron. Soc. 262 (1993) 627-649. 

[49] CD. Andriesse, "Radiating cosmic dust," Vistas in Astronomy 21 (1977) 107. 

[50] WMAP Collaboration, E. Komatsu et al.^ "Five- Year Wilkinson Microwave 
Anisotropy Probe (WMAP) Observations: Cosmological Interpretation," 
arxiv: 0803. 0547 [astro-ph] . 

[51] M. Tegmark and M. J. Rees, "Why is the CMB fluctuation level 10"^?," Astrophys. J. 
499 (1998) 526-532, astro-ph/9709058. 



-68- 



